Antisense modulation of NAC expression

ABSTRACT

Antisense compounds, compositions and methods are provided for modulating the expression of NAC. The compositions comprise antisense compounds, particularly antisense oligonucleotides, targeted to nucleic acids encoding NAC. Methods of using these compounds for modulation of NAC expression and for treatment of diseases associated with expression of NAC are provided.

FIELD OF THE INVENTION

[0001] The present invention provides compositions and methods for modulating the expression of NAC. In particular, this invention relates to compounds, particularly oligonucleotides, specifically hybridizable with nucleic acids encoding NAC. Such compounds have been shown to modulate the expression of NAC.

BACKGROUND OF THE INVENTION

[0002] Apoptosis, or programmed cell death, is a naturally occurring process that has been strongly conserved during evolution to prevent uncontrolled cell proliferation. This form of cell suicide plays a crucial role in ensuring the development and maintenance of multicellular organisms by eliminating superfluous or unwanted cells. However, if this process becomes overstimulated, cell loss and degenerative disorders including neurological disorders such as Alzheimers, Parkinsons, ALS, retinitis pigmentosa and blood cell disorders can result. Stimuli which can trigger apoptosis include growth factors such as tumor necrosis factor (TNF), Fas and transforming growth factor beta (TGFβ), neurotransmitters, growth factor withdrawal, loss of extracellular matrix attachment and extreme fluctuations in intracellular calcium levels (Afford and Randhawa, Mol. Pathol., 2000, 53, 55-63).

[0003] Alternatively, insufficient apoptosis, triggered by growth factors, extracellular matrix changes, CD40 ligand, viral gene products neutral amino acids, zinc, estrogen and androgens, can contribute to the development of cancer, autoimmune disorders and viral infections (Afford and Randhawa, Mol. Pathol., 2000, 53, 55-63). Consequently, apoptosis is regulated under normal circumstances by the interaction of gene products that either induce or inhibit cell death and several gene products that modulate the apoptotic process have now been identified.

[0004] The most well characterized apoptotic-signaling cascade to date is orchestrated by a family of cysteine proteases known as caspases. These enzymes activate apoptosis through proteolytic events triggered by one of several described mechanisms; including ligand binding to the cell surface death receptors of either the TNF or NGF receptor families, changes in mitochondrial integrity or chemical induction (Thornberry, British Medical Bulletin, 1997, 53, 478-490).

[0005] The proteins that control caspase activation pathways often exist as families that can be recognized based on their amino acid sequence and/or structural similarity. Moreover, interactions among these proteins are commonly mediated by domains that are intimately associated with apoptosis regulation including caspase-associated recruitment domains (CARDs), death domains (DDs) and death effector domains (DEDs) (Reed, Am. J. Pathol., 2000, 157, 1415-1430).

[0006] The overall structure of the CARD is comprised of six alpha helices (Reed, Am. J. Pathol., 2000, 157, 1415-1430). Homotypic interactions among CARD-carrying proteins play important roles in caspase activation throughout evolution (Reed, Am. J. Pathol., 2000, 157, 1415-1430). Several pro-caspases contain N-terminal CARDs in their pro-domains including caspases 1, 2, 4, 5 and 9 in humans and caspases 1, 2, 9, 11 and 12 in mice (Reed, Am. J. Pathol., 2000, 157, 1415-1430).

[0007] The members of the Apaf1/CED4 family play central roles in apoptosis regulation as activators of caspase family cell death proteases. NAC is a recently discovered Apaf1/CED4 family member, which is also known as death effector filament-forming CED4-like apoptosis protein (DEFCAP), NALP1 and CARD7. It contains a nucleotide-binding domain, a leucine-rich repeat region and a CARD which interacts selectively with the CARD domains of Apaf1, caspase-2 and caspase-9 (Chu et al., J. Biol. Chem., 2001, 276, 9239-9245; Hlaing et al., J. Biol. Chem., 2001, 276, 9230-9238).

[0008] NAC was cloned and mapped to chromosome 17p13 in 2001 (Chu et al., J. Biol. Chem., 2001, 276, 9239-9245; Hlaing et al., J. Biol. Chem., 2001, 276, 9230-9238). It is widely expressed in human tissues with the highest levels found in peripheral blood leukocytes, thymus, spleen and heart (Chu et al., J. Biol. Chem., 2001, 276, 9239-9245; Hlaing et al., J. Biol. Chem., 2001, 276, 9230-9238). It is also highly expressed in a chronic myelogenous leukemia cell line (Hlaing et al., J. Biol. Chem., 2001, 276, 9230-9238). NAC expression in vivo is associated with terminal differentiation of short-lived cells (Chu et al., J. Biol. Chem., 2001, 276, 9239-9245).

[0009] Martinon et al. determined that NAC contains a pyrin domain which is homologous to the product of the MEFV gene (Martinon et al., Curr. Biol., 2001, 11, R118-120).

[0010] Hlaing et al. reported two alternatively spliced isoforms of NAC, designated DEFCAP-L and DEFCAP-S that differ in 44 amino acids. Both isoforms are capable of interacting with caspase-2 and caspase-9, however, overexpression of DEFCAP-L, but not DEFCAP-S in breast adenocarcinoma cells results in significant levels of apoptosis (Hlaing et al., J. Biol. Chem., 2001, 276, 9230-9238).

[0011] Genbank accession numbers AF229059, AF229060, AF229061 and AF229062 represent four splice variants of NAC, denoted NAC-alpha, NAC-beta, NAC-gamma and NAC-delta, respectively.

[0012] Disclosed and claimed in PCT publication WO 01/00826 are isolated nucleic acid sequences encoding NAC, vectors containing said nucleic acid sequences and an antisense nucleic acid capable of specifically binding to said nucleic acid (Reed, 2001).

[0013] In HEK293 cells, which contain little endogenous NAC, overexpression of NAC by transient transfection promotes formation of a large apoptosome upon cytochrome c stimulation. A catalytic antisense DNAzyme targeting the translation initiation site of NAC mRNA was used to reduce levels of NAC which, in turn, decreased the ability of cytochrome c to activate caspases in cell extracts in vitro. A non-catalytic control oligonucleotide differing by a single base was inactive (Chu et al., J. Biol. Chem., 2001, 276, 9239-9245).

[0014] Currently, there exists a need to identify methods of modulating apoptosis for the therapeutic treatment of human diseases and it is believed that agents capable of modulating the expression of regulatory proteins and adaptor proteins involved in caspase apoptosis signaling cascades will be integral to these methods.

[0015] Currently, there are no known therapeutic agents that effectively inhibit the synthesis of NAC. Consequently, there is a need for agents capable of effectively inhibiting NAC function.

[0016] Antisense technology is emerging as an effective means for reducing the expression of specific gene products and may therefore prove to be uniquely useful in a number of therapeutic, diagnostic, and research applications for the modulation of gene expression and cellular processes.

[0017] The present invention provides compositions and methods for modulating NAC expression, and modulation of alternatively-spliced isoforms of NAC.

SUMMARY OF THE INVENTION

[0018] The present invention is directed to compounds, particularly antisense oligonucleotides, which are targeted to a nucleic acid encoding NAC, and which modulate the expression of NAC. Pharmaceutical and other compositions comprising the compounds of the invention are also provided. Further provided are methods of modulating the expression of NAC in cells or tissues comprising contacting said cells or tissues with one or more of the antisense compounds or compositions of the invention. Further provided are methods of treating an animal, particularly a human, suspected of having or being prone to a disease or condition associated with expression of NAC by administering a therapeutically or prophylactically effective amount of one or more of the antisense compounds or compositions of the invention.

DETAILED DESCRIPTION OF THE INVENTION

[0019] The present invention employs oligomeric compounds, particularly antisense oligonucleotides, for use in modulating the function of nucleic acid molecules encoding NAC, ultimately modulating the amount of NAC produced. This is accomplished by providing antisense compounds which specifically hybridize with one or more nucleic acids encoding NAC. As used herein, the terms “target nucleic acid” and “nucleic acid encoding NAC” encompass DNA encoding NAC, RNA (including pre-mRNA and mRNA) transcribed from such DNA, and also cDNA derived from such RNA. The specific hybridization of an oligomeric compound with its target nucleic acid interferes with the normal function of the nucleic acid. This modulation of function of a target nucleic acid by compounds which specifically hybridize to it is generally referred to as “antisense”. The functions of DNA to be interfered with include replication and transcription. The functions of RNA to be interfered with include all vital functions such as, for example, translocation of the RNA to the site of protein translation, translation of protein from the RNA, splicing of the RNA to yield one or more mRNA species, and catalytic activity which may be engaged in or facilitated by the RNA. The overall effect of such interference with target nucleic acid function is modulation of the expression of NAC. In the context of the present invention, “modulation” means either an increase (stimulation) or a decrease (inhibition) in the expression of a gene. In the context of the present invention, inhibition is the preferred form of modulation of gene expression and mRNA is a preferred target.

[0020] It is preferred to target specific nucleic acids for antisense. “Targeting” an antisense compound to a particular nucleic acid, in the context of this invention, is a multistep process. The process usually begins with the identification of a nucleic acid sequence whose function is to be modulated. This may be, for example, a cellular gene (or mRNA transcribed from the gene) whose expression is associated with a particular disorder or disease state, or a nucleic acid molecule from an infectious agent. In the present invention, the target is a nucleic acid molecule encoding NAC. The targeting process also includes determination of a site or sites within this gene for the antisense interaction to occur such that the desired effect, e.g., detection or modulation of expression of the protein, will result. Within the context of the present invention, a preferred intragenic site is the region encompassing the translation initiation or termination codon of the open reading frame (ORF) of the gene. Since, as is known in the art, the translation initiation codon is typically 5′-AUG (in transcribed mRNA molecules; 5′-ATG in the corresponding DNA molecule), the translation initiation codon is also referred to as the “AUG codon,” the “start codon” or the “AUG start codon”. A minority of genes have a translation initiation codon having the RNA sequence 5′-GUG, 5′-UUG or 5′-CUG, and 5′-AUA, 5′-ACG and 5′-CUG have been shown to function in vivo. Thus, the terms “translation initiation codon” and “start codon” can encompass many codon sequences, even though the initiator amino acid in each instance is typically methionine (in eukaryotes) or formylmethionine (in prokaryotes). It is also known in the art that eukaryotic and prokaryotic genes may have two or more alternative start codons, any one of which may be preferentially utilized for translation initiation in a particular cell type or tissue, or under a particular set of conditions. In the context of the invention, “start codon” and “translation initiation codon” refer to the codon or codons that are used in vivo to initiate translation of an mRNA molecule transcribed from a gene encoding NAC, regardless of the sequence(s) of such codons.

[0021] It is also known in the art that a translation termination codon (or “stop codon”) of a gene may have one of three sequences, i.e., 5′-UAA, 5′-UAG and 5′-UGA (the corresponding DNA sequences are 5′-TAA, 5′-TAG and 5′-TGA, respectively). The terms “start codon region” and “translation initiation codon region” refer to a portion of such an mRNA or gene that encompasses from about 25 to about 50 contiguous nucleotides in either direction (i.e., 5′ or 3′) from a translation initiation codon. Similarly, the terms “stop codon region” and “translation termination codon region” refer to a portion of such an mRNA or gene that encompasses from about 25 to about 50 contiguous nucleotides in either direction (i.e., 5′ or 3′) from a translation termination codon.

[0022] The open reading frame (ORF) or “coding region,” which is known in the art to refer to the region between the translation initiation codon and the translation termination codon, is also a region which may be targeted effectively. Other target regions include the 5′ untranslated region (5′UTR), known in the art to refer to the portion of an mRNA in the 5′ direction from the translation initiation codon, and thus including nucleotides between the 5′ cap site and the translation initiation codon of an mRNA or corresponding nucleotides on the gene, and the 3′ untranslated region (3′UTR), known in the art to refer to the portion of an mRNA in the 3′ direction from the translation termination codon, and thus including nucleotides between the translation termination codon and 3′ end of an mRNA or corresponding nucleotides on the gene. The 5′ cap of an mRNA comprises an N7-methylated guanosine residue joined to the 5′-most residue of the mRNA via a 5′-5′ triphosphate linkage. The 5′ cap region of an mRNA is considered to include the 5′ cap structure itself as well as the first 50 nucleotides adjacent to the cap. The 5′ cap region may also be a preferred target region.

[0023] Although some eukaryotic mRNA transcripts are directly translated, many contain one or more regions, known as “introns,” which are excised from a transcript before it is translated. The remaining (and therefore translated) regions are known as “exons” and are spliced together to form a continuous mRNA sequence. mRNA splice sites, i.e., intron-exon junctions, may also be preferred target regions, and are particularly useful in situations where aberrant splicing is implicated in disease, or where an overproduction of a particular mRNA splice product is implicated in disease. Aberrant fusion junctions due to rearrangements or deletions are also preferred targets. It has also been found that introns can also be effective, and therefore preferred, target regions for antisense compounds targeted, for example, to DNA or pre-mRNA.

[0024] Once one or more target sites have been identified, oligonucleotides are chosen which are sufficiently complementary to the target, i.e., hybridize sufficiently well and with sufficient specificity, to give the desired effect.

[0025] In the context of this invention, “hybridization” means hydrogen bonding, which may be Watson-Crick, Hoogsteen or reversed Hoogsteen hydrogen bonding, between complementary nucleoside or nucleotide bases. For example, adenine and thymine are complementary nucleobases which pair through the formation of hydrogen bonds. “Complementary,” as used herein, refers to the capacity for precise pairing between two nucleotides. For example, if a nucleotide at a certain position of an oligonucleotide is capable of hydrogen bonding with a nucleotide at the same position of a DNA or RNA molecule, then the oligonucleotide and the DNA or RNA are considered to be complementary to each other at that position. The oligonucleotide and the DNA or RNA are complementary to each other when a sufficient number of corresponding positions in each molecule are occupied by nucleotides which can hydrogen bond with each other. Thus, “specifically hybridizable” and “complementary” are terms which are used to indicate a sufficient degree of complementarity or precise pairing such that stable and specific binding occurs between the oligonucleotide and the DNA or RNA target. It is understood in the art that the sequence of an antisense compound need not be 100% complementary to that of its target nucleic acid to be specifically hybridizable. An antisense compound is specifically hybridizable when binding of the compound to the target DNA or RNA molecule interferes with the normal function of the target DNA or RNA to cause a loss of utility, and there is a sufficient degree of complementarity to avoid non-specific binding of the antisense compound to non-target sequences under conditions in which specific binding is desired, i.e., under physiological conditions in the case of in vivo assays or therapeutic treatment, and in the case of in vitro assays, under conditions in which the assays are performed.

[0026] Antisense and other compounds of the invention which hybridize to the target and inhibit expression of the target are identified through experimentation, and the sequences of these compounds are hereinbelow identified as preferred embodiments of the invention. The target sites to which these preferred sequences are complementary are hereinbelow referred to as “active sites” and are therefore preferred sites for targeting. Therefore another embodiment of the invention encompasses compounds which hybridize to these active sites.

[0027] Antisense compounds are commonly used as research reagents and diagnostics. For example, antisense oligonucleotides, which are able to inhibit gene expression with exquisite specificity, are often used by those of ordinary skill to elucidate the function of particular genes. Antisense compounds are also used, for example, to distinguish between functions of various members of a biological pathway. Antisense modulation has, therefore, been harnessed for research use.

[0028] For use in kits and diagnostics, the antisense compounds of the present invention, either alone or in combination with other antisense compounds or therapeutics, can be used as tools in differential and/or combinatorial analyses to elucidate expression patterns of a portion or the entire complement of genes expressed within cells and tissues.

[0029] Expression patterns within cells or tissues treated with one or more antisense compounds are compared to control cells or tissues not treated with antisense compounds and the patterns produced are analyzed for differential levels of gene expression as they pertain, for example, to disease association, signaling pathway, cellular localization, expression level, size, structure or function of the genes examined. These analyses can be performed on stimulated or unstimulated cells and in the presence or absence of other compounds which affect expression patterns.

[0030] Examples of methods of gene expression analysis known in the art include DNA arrays or microarrays (Brazma and Vilo, FEBS Lett., 2000, 480, 17-24; Celis, et al., FEBS Lett., 2000, 480, 2-16), SAGE (serial analysis of gene expression)(Madden, et al., Drug Discov. Today, 2000, 5, 415-425), READS (restriction enzyme amplification of digested cDNAs) (Prashar and Weissman, Methods Enzymol., 1999, 303, 258-72), TOGA (total gene expression analysis) (Sutcliffe, et al., Proc. Natl. Acad. Sci. U. S. A., 2000, 97, 1976-81), protein arrays and proteomics (Celis, et al., FEBS Lett., 2000, 480, 2-16; Jungblut, et al., Electrophoresis, 1999, 20, 2100-10), expressed sequence tag (EST) sequencing (Celis, et al., FEBS Lett., 2000, 480, 2-16; Larsson, et al., J. Biotechnol., 2000, 80, 143-57), subtractive RNA fingerprinting (SuRF) (Fuchs, et al., Anal. Biochem., 2000, 286, 91-98; Larson, et al., Cytometry, 2000, 41, 203-208), subtractive cloning, differential display (DD) (Jurecic and Belmont, Curr. Opin. Microbiol., 2000, 3, 316-21), comparative genomic hybridization (Carulli, et al., J. Cell Biochem. Suppl., 1998, 31, 286-96), FISH (fluorescent in situ hybridization) techniques (Going and Gusterson, Eur. J. Cancer, 1999, 35, 1895-904) and mass spectrometry methods (reviewed in (To, Comb. Chem. High Throughput Screen, 2000, 3, 235-41).

[0031] The specificity and sensitivity of antisense is also harnessed by those of skill in the art for therapeutic uses. Antisense oligonucleotides have been employed as therapeutic moieties in the treatment of disease states in animals and man. Antisense oligonucleotide drugs, including ribozymes, have been safely and effectively administered to humans and numerous clinical trials are presently underway. It is thus established that oligonucleotides can be useful therapeutic modalities that can be configured to be useful in treatment regimes for treatment of cells, tissues and animals, especially humans.

[0032] In the context of this invention, the term “oligonucleotide” refers to an oligomer or polymer of ribonucleic acid (RNA) or deoxyribonucleic acid (DNA) or mimetics thereof. This term includes oligonucleotides composed of naturally-occurring nucleobases, sugars and covalent internucleoside (backbone) linkages as well as oligonucleotides having non-naturally-occurring portions which function similarly. Such modified or substituted oligonucleotides are often preferred over native forms because of desirable properties such as, for example, enhanced cellular uptake, enhanced affinity for nucleic acid target and increased stability in the presence of nucleases.

[0033] While antisense oligonucleotides are a preferred form of antisense compound, the present invention comprehends other oligomeric antisense compounds, including but not limited to oligonucleotide mimetics such as are described below. The antisense compounds in accordance with this invention preferably comprise from about 8 to about 50 nucleobases (i.e. from about 8 to about 50 linked nucleosides). Particularly preferred antisense compounds are antisense oligonucleotides, even more preferably those comprising from about 12 to about 30 nucleobases. Antisense compounds include ribozymes, external guide sequence (EGS) oligonucleotides (oligozymes), and other short catalytic RNAs or catalytic oligonucleotides which hybridize to the target nucleic acid and modulate its expression.

[0034] As is known in the art, a nucleoside is a base-sugar combination. The base portion of the nucleoside is normally a heterocyclic base. The two most common classes of such heterocyclic bases are the purines and the pyrimidines. Nucleotides are nucleosides that further include a phosphate group covalently linked to the sugar portion of the nucleoside. For those nucleosides that include a pentofuranosyl sugar, the phosphate group can be linked to either the 2′, 3′ or 5′ hydroxyl moiety of the sugar. In forming oligonucleotides, the phosphate groups covalently link adjacent nucleosides to one another to form a linear polymeric compound. In turn the respective ends of this linear polymeric structure can be further joined to form a circular structure, however, open linear structures are generally preferred. Within the oligonucleotide structure, the phosphate groups are commonly referred to as forming the internucleoside backbone of the oligonucleotide. The normal linkage or backbone of RNA and DNA is a 3′ to 5′ phosphodiester linkage.

[0035] Specific examples of preferred antisense compounds useful in this invention include oligonucleotides containing modified backbones or non-natural internucleoside linkages. As defined in this specification, oligonucleotides having modified backbones include those that retain a phosphorus atom in the backbone and those that do not have a phosphorus atom in the backbone. For the purposes of this specification, and as sometimes referenced in the art, modified oligonucleotides that do not have a phosphorus atom in their internucleoside backbone can also be considered to be oligonucleosides.

[0036] Preferred modified oligonucleotide backbones include, for example, phosphorothioates, chiral phosphorothioates, phosphorodithioates, phosphotriesters, aminoalkylphosphotriesters, methyl and other alkyl phosphonates including 3′-alkylene phosphonates, 5′-alkylene phosphonates and chiral phosphonates, phosphinates, phosphoramidates including 3′-amino phosphoramidate and aminoalkylphosphoramidates, thionophosphoramidates, thionoalkylphosphonates, thionoalkylphosphotriesters, selenophosphates and boranophosphates having normal 3′-5′ linkages, 2′-5′ linked analogs of these, and those having inverted polarity wherein one or more internucleotide linkages is a 3′ to 3′, 5′ to 5′ or 2′ to 2′ linkage. Preferred oligonucleotides having inverted polarity comprise a single 3′ to 3′ linkage at the 3′-most internucleotide linkage i.e. a single inverted nucleoside residue which may be a basic (the nucleobase is missing or has a hydroxyl group in place thereof). Various salts, mixed salts and free acid forms are also included.

[0037] Representative United States patents that teach the preparation of the above phosphorus-containing linkages include, but are not limited to, U.S. Pat. Nos. 3,687,808; 4,469,863; 4,476,301; 5,023,243; 5,177,196; 5,188,897; 5,264,423; 5,276,019; 5,278,302; 5,286,717; 5,321,131; 5,399,676; 5,405,939; 5,453,496; 5,455,233; 5,466,677; 5,476,925; 5,519,126; 5,536,821; 5,541,306; 5,550,111; 5,563,253; 5,571,799; 5,587,361; 5,194,599; 5,565,555; 5,527,899; 5,721,218; 5,672,697 and 5,625,050, certain of which are commonly owned with this application, and each of which is herein incorporated by reference.

[0038] Preferred modified oligonucleotide backbones that do not include a phosphorus atom therein have backbones that are formed by short chain alkyl or cycloalkyl internucleoside linkages, mixed heteroatom and alkyl or cycloalkyl internucleoside linkages, or one or more short chain heteroatomic or heterocyclic internucleoside linkages. These include those having morpholino linkages (formed in part from the sugar portion of a nucleoside); siloxane backbones; sulfide, sulfoxide and sulfone backbones; formacetyl and thioformacetyl backbones; methylene formacetyl and thioformacetyl backbones; riboacetyl backbones; alkene containing backbones; sulfamate backbones; methyleneimino and methylenehydrazino backbones; sulfonate and sulfonamide backbones; amide backbones; and others having mixed N, O, S and CH₂ component parts.

[0039] Representative United States patents that teach the preparation of the above oligonucleosides include, but are not limited to, U.S. Pat. Nos. 5,034,506; 5,166,315; 5,185,444; 5,214,134; 5,216,141; 5,235,033; 5,264,562; 5,264,564; 5,405,938; 5,434,257; 5,466,677; 5,470,967; 5,489,677; 5,541,307; 5,561,225; 5,596,086; 5,602,240; 5,610,289; 5,602,240; 5,608,046; 5,610,289; 5,618,704; 5,623,070; 5,663,312; 5,633,360; 5,677,437; 5,792,608; 5,646,269 and 5,677,439, certain of which are commonly owned with this application, and each of which is herein incorporated by reference.

[0040] In other preferred oligonucleotide mimetics, both the sugar and the internucleoside linkage, i.e., the backbone, of the nucleotide units are replaced with novel groups. The base units are maintained for hybridization with an appropriate nucleic acid target compound. One such oligomeric compound, an oligonucleotide mimetic that has been shown to have excellent hybridization properties, is referred to as a peptide nucleic acid (PNA). In PNA compounds, the sugar-backbone of an oligonucleotide is replaced with an amide containing backbone, in particular an aminoethylglycine backbone. The nucleobases are retained and are bound directly or indirectly to aza nitrogen atoms of the amide portion of the backbone. Representative United States patents that teach the preparation of PNA compounds include, but are not limited to, U.S. Pat. Nos. 5,539,082; 5,714,331; and 5,719,262, each of which is herein incorporated by reference. Further teaching of PNA compounds can be found in Nielsen et al., Science, 1991, 254, 1497-1500.

[0041] Most preferred embodiments of the invention are oligonucleotides with phosphorothioate backbones and oligonucleosides with heteroatom backbones, and in particular —CH₂—NH—O—CH₂—, —CH₂—N(CH₃)—O—CH₂— [known as a methylene (methylimino) or MMI backbone], —CH₂—O—N(CH₃)—CH₂—, —CH₂—N(CH₃)—N(CH₃)—CH₂— and —O—N(CH₃)—CH₂—CH₂— [wherein the native phosphodiester backbone is represented as —O—P—O—CH₂—] of the above referenced U.S. Pat. No. 5,489,677, and the amide backbones of the above referenced U.S. Pat. No. 5,602,240. Also preferred are oligonucleotides having morpholino backbone structures of the above-referenced U.S. Pat. No. 5,034,506.

[0042] Modified oligonucleotides may also contain one or more substituted sugar moieties. Preferred oligonucleotides comprise one of the following at the 2′ position: OH; F; O—, S—, or N-alkyl; O—, S—, or N-alkenyl; O—, S— or N-alkynyl; or O-alkyl-O-alkyl, wherein the alkyl, alkenyl and alkynyl may be substituted or unsubstituted C₁ to C₁₀ alkyl or C₂ to C₁₀ alkenyl and alkynyl. Particularly preferred are O[(CH₂)_(n)O]_(m)CH₃, O(CH₂)_(n)OCH₃, O(CH₂)_(n)NH₂, O(CH₂)_(n) CH₃, O(CH₂)_(n)ONH₂, and O(CH₂)_(n)ON[(CH₂)_(n)CH₃)]₂, where n and m are from 1 to about 10. Other preferred oligonucleotides comprise one of the following at the 2′ position: C₁ to C₁₀ lower alkyl, substituted lower alkyl, alkenyl, alkynyl, alkaryl, aralkyl, O-alkaryl or O-aralkyl, SH, SCH₃, OCN, Cl, Br, CN, CF₃, OCF₃, SOCH₃, SO₂CH₃, ONO₂, NO₂, N₃, NH₂, heterocycloalkyl, heterocycloalkaryl, aminoalkylamino, polyalkylamino, substituted silyl, an RNA cleaving group, a reporter group, an intercalator, a group for improving the pharmacokinetic properties of an oligonucleotide, or a group for improving the pharmacodynamic properties of an oligonucleotide, and other substituents having similar properties. A preferred modification includes 2′-methoxyethoxy (2′-O—CH₂CH₂OCH₃, also known as 2′-O-(2-methoxyethyl) or 2′-MOE) (Martin et al., Helv. Chim. Acta, 1995, 78, 486-504) i.e., an alkoxyalkoxy group. A further preferred modification includes 2′-dimethylaminooxyethoxy, i.e., a O(CH₂)₂ON(CH₃)₂ group, also known as 2′-DMAOE, as described in examples hereinbelow, and 2′-dimethylaminoethoxyethoxy (also known in the art as 2′-O-dimethylaminoethoxyethyl or 2′-DMAEOE), i.e., 2′-O—CH₂—O—CH₂—N(CH₂)₂, also described in examples hereinbelow.

[0043] A further prefered modification includes Locked Nucleic Acids (LNAs) in which the 2′-hydroxyl group is linked to the 3′ or 4′ carbon atom of the sugar ring thereby forming a bicyclic sugar moiety. The linkage is preferably a methelyne (—CH₂—)_(n) group bridging the 2′ oxygen atom and the 4′ carbon atom wherein n is 1 or 2. LNAs and preparation thereof are described in WO 98/39352 and WO 99/14226.

[0044] Other preferred modifications include 2′-methoxy (2′-O—CH₃), 2′-aminopropoxy (2′-OCH₂CH₂CH₂NH₂), 2′-allyl (2′-CH₂—CH═CH₂), 2′-O-allyl (2′-O—CH₂—CH═CH₂) and 2′-fluoro (2′-F). The 2′-modification may be in the arabino (up) position or ribo (down) position. A preferred 2′-arabino modification is 2′-F. Similar modifications may also be made at other positions on the oligonucleotide, particularly the 3′ position of the sugar on the 3′ terminal nucleotide or in 2′-5′ linked oligonucleotides and the 5′ position of 5′ terminal nucleotide. Oligonucleotides may also have sugar mimetics such as cyclobutyl moieties in place of the pentofuranosyl sugar. Representative United States patents that teach the preparation of such modified sugar structures include, but are not limited to, U.S. Pat. Nos. 4,981,957; 5,118,800; 5,319,080; 5,359,044; 5,393,878; 5,446,137; 5,466,786; 5,514,785; 5,519,134; 5,567,811; 5,576,427; 5,591,722; 5,597,909; 5,610,300; 5,627,053; 5,639,873; 5,646,265; 5,658,873; 5,670,633; 5,792,747; and 5,700,920, certain of which are commonly owned with the instant application, and each of which is herein incorporated by reference in its entirety.

[0045] Oligonucleotides may also include nucleobase (often referred to in the art simply as “base”) modifications or substitutions. As used herein, “unmodified” or “natural” nucleobases include the purine bases adenine (A) and guanine (G), and the pyrimidine bases thymine (T), cytosine (C) and uracil (U). Modified nucleobases include other synthetic and natural nucleobases such as 5-methylcytosine (5-me-C), 5-hydroxymethyl cytosine, xanthine, hypoxanthine, 2-aminoadenine, 6-methyl and other alkyl derivatives of adenine and guanine, 2-propyl and other alkyl derivatives of adenine and guanine, 2-thiouracil, 2-thiothymine and 2-thiocytosine, 5-halouracil and cytosine, 5-propynyl (—C≡C—CH₃) uracil and cytosine and other alkynyl derivatives of pyrimidine bases, 6-azo uracil, cytosine and thymine, 5-uracil (pseudouracil), 4-thiouracil, 8-halo, 8-amino, 8-thiol, 8-thioalkyl, 8-hydroxyl and other 8-substituted adenines and guanines, 5-halo particularly 5-bromo, 5-trifluoromethyl and other 5-substituted uracils and cytosines, 7-methylguanine and 7-methyladenine, 2-F-adenine, 2-amino-adenine, 8-azaguanine and 8-azaadenine, 7-deazaguanine and 7-deazaadenine and 3-deazaguanine and 3-deazaadenine. Further modified nucleobases include tricyclic pyrimidines such as phenoxazine cytidine(1H-pyrimido[5,4-b][1,4]benzoxazin-2(3H)-one), phenothiazine cytidine (1H-pyrimido[5,4-b][1,4]benzothiazin-2(3H)-one), G-clamps such as a substituted phenoxazine cytidine (e.g. 9-(2-aminoethoxy)-H-pyrimido[5,4-b][1,4]benzoxazin-2(3H)-one), carbazole cytidine (2H-pyrimido[4,5-b]indol-2-one), pyridoindole cytidine (H-pyrido[3′,2′:4,5]pyrrolo[2,3-d]pyrimidin-2-one). Modified nucleobases may also include those in which the purine or pyrimidine base is replaced with other heterocycles, for example 7-deaza-adenine, 7-deazaguanosine, 2-aminopyridine and 2-pyridone. Further nucleobases include those disclosed in U.S. Pat. No. 3,687,808, those disclosed in The Concise Encyclopedia Of Polymer Science And Engineering,pages 858-859, Kroschwitz, J. I., ed. John Wiley & Sons, 1990, those disclosed by Englisch et al., Angewandte Chemie, International Edition, 1991, 30, 613, and those disclosed by Sanghvi, Y. S., Chapter 15, Antisense Research and Applications, pages 289-302, Crooke, S. T. and Lebleu, B. ed., CRC Press, 1993. Certain of these nucleobases are particularly useful for increasing the binding affinity of the oligomeric compounds of the invention. These include 5-substituted pyrimidines, 6-azapyrimidines and N-2, N-6 and O 6 substituted purines, including 2-aminopropyladenine, 5-propynyluracil and 5-propynylcytosine. 5-methylcytosine substitutions have been shown to increase nucleic acid duplex stability by 0.6-1.2° C. (Sanghvi, Y. S., Crooke, S. T. and Lebleu, B., eds., Antisense Research and Applications, CRC Press, Boca Raton, 1993, pp. 276-278) and are presently preferred base substitutions, even more particularly when combined with 2′-O-methoxyethyl sugar modifications.

[0046] Representative United States patents that teach the preparation of certain of the above noted modified nucleobases as well as other modified nucleobases include, but are not limited to, the above noted U.S. Pat. No. 3,687,808, as well as U.S. Pat. Nos. 4,845,205; 5,130,302; 5,134,066; 5,175,273; 5,367,066; 5,432,272; 5,457,187; 5,459,255; 5,484,908; 5,502,177; 5,525,711; 5,552,540; 5,587,469; 5,594,121, 5,596,091; 5,614,617; 5,645,985; 5,830,653; 5,763,588; 6,005,096; and 5,681,941, certain of which are commonly owned with the instant application, and each of which is herein incorporated by reference, and U.S. Pat. No. 5,750,692, which is commonly owned with the instant application and also herein incorporated by reference.

[0047] Another modification of the oligonucleotides of the invention involves chemically linking to the oligonucleotide one or more moieties or conjugates which enhance the activity, cellular distribution or cellular uptake of the oligonucleotide. The compounds of the invention can include conjugate groups covalently bound to functional groups such as primary or secondary hydroxyl groups. Conjugate groups of the invention include intercalators, reporter molecules, polyamines, polyamides, polyethylene glycols, polyethers, groups that enhance the pharmacodynamic properties of oligomers, and groups that enhance the pharmacokinetic properties of oligomers. Typical conjugates groups include cholesterols, lipids, phospholipids, biotin, phenazine, folate, phenanthridine, anthraquinone, acridine, fluoresceins, rhodamines, coumarins, and dyes. Groups that enhance the pharmacodynamic properties, in the context of this invention, include groups that improve oligomer uptake, enhance oligomer resistance to degradation, and/or strengthen sequence-specific hybridization with RNA. Groups that enhance the pharmacokinetic properties, in the context of this invention, include groups that improve oligomer uptake, distribution, metabolism or excretion. Representative conjugate groups are disclosed in International Patent Application PCT/US92/09196, filed Oct. 23, 1992 the entire disclosure of which is incorporated herein by reference. Conjugate moieties include but are not limited to lipid moieties such as a cholesterol moiety (Letsinger et al., Proc. Natl. Acad. Sci. USA, 1989, 86, 6553-6556), cholic acid (Manoharan et al., Bioorg. Med. Chem. Let., 1994, 4, 1053-1060), a thioether, e.g., hexyl-S-tritylthiol (Manoharan et al., Ann. N.Y. Acad. Sci., 1992, 660, 306-309; Manoharan et al., Bioorg. Med. Chem. Let., 1993, 3, 2765-2770), a thiocholesterol (Oberhauser et al., Nucl. Acids Res., 1992, 20, 533-538), an aliphatic chain, e.g., dodecandiol or undecyl residues (Saison-Behmoaras et al., EMBO J., 1991, 10, 1111-1118; Kabanov et al., FEBS Lett., 1990, 259, 327-330; Svinarchuk et al., Biochimie, 1993, 75, 49-54), a phospholipid, e.g., di-hexadecyl-rac-glycerol or triethyl-ammonium 1,2-di-O-hexadecyl-rac-glycero-3-H-phosphonate (Manoharan et al., Tetrahedron Lett., 1995, 36, 3651-3654; Shea et al., i Nucl. Acids Res., 1990, 18, 3777-3783), a polyamine or a polyethylene glycol chain (Manoharan et al., Nucleosides & Nucleotides, 1995, 14, 969-973), or adamantane acetic acid (Manoharan et al., Tetrahedron Lett., 1995, 36, 3651-3654), a palmityl moiety (Mishra et al., Biochim. Biophys. Acta, 1995, 1264, 229-237), or an octadecylamine or hexylamino-carbonyl-oxycholesterol moiety (Crooke et al., J. Pharmacol. Exp. Ther., 1996, 277, 923-937. Oligonucleotides of the invention may also be conjugated to active drug substances, for example, aspirin, warfarin, phenylbutazone, ibuprofen, suprofen, fenbufen, ketoprofen, (S)-(+)-pranoprofen, carprofen, dansylsarcosine, 2,3,5-triiodobenzoic acid, flufenamic acid, folinic acid, a benzothiadiazide, chlorothiazide, a diazepine, indomethicin, a barbiturate, a cephalosporin, a sulfa drug, an antidiabetic, an antibacterial or an antibiotic. Oligonucleotide-drug conjugates and their preparation are described in U.S. patent application Ser. No. 09/334,130 (filed Jun. 15, 1999) which is incorporated herein by reference in its entirety.

[0048] Representative United States patents that teach the preparation of such oligonucleotide conjugates include, but are not limited to, U.S. Pat. Nos. 4,828,979; 4,948,882; 5,218,105; 5,525,465; 5,541,313; 5,545,730; 5,552,538; 5,578,717, 5,580,731; 5,580,731; 5,591,584; 5,109,124; 5,118,802; 5,138,045; 5,414,077; 5,486,603; 5,512,439; 5,578,718; 5,608,046; 4,587,044; 4,605,735; 4,667,025; 4,762,779; 4,789,737; 4,824,941; 4,835,263; 4,876,335; 4,904,582; 4,958,013; 5,082,830; 5,112,963; 5,214,136; 5,082,830; 5,112,963; 5,214,136; 5,245,022; 5,254,469; 5,258,506; 5,262,536; 5,272,250; 5,292,873; 5,317,098; 5,371,241, 5,391,723; 5,416,203, 5,451,463; 5,510,475; 5,512,667; 5,514,785; 5,565,552; 5,567,810; 5,574,142; 5,585,481; 5,587,371; 5,595,726; 5,597,696; 5,599,923; 5,599,928 and 5,688,941, certain of which are commonly owned with the instant application, and each of which is herein incorporated by reference.

[0049] It is not necessary for all positions in a given compound to be uniformly modified, and in fact more than one of the aforementioned modifications may be incorporated in a single compound or even at a single nucleoside within an oligonucleotide. The present invention also includes antisense compounds which are chimeric compounds. “Chimeric” antisense compounds or “chimeras,” in the context of this invention, are antisense compounds, particularly oligonucleotides, which contain two or more chemically distinct regions, each made up of at least one monomer unit, i.e., a nucleotide in the case of an oligonucleotide compound. These oligonucleotides typically contain at least one region wherein the oligonucleotide is modified so as to confer upon the oligonucleotide increased resistance to nuclease degradation, increased cellular uptake, and/or increased binding affinity for the target nucleic acid. An additional region of the oligonucleotide may serve as a substrate for enzymes capable of cleaving RNA:DNA or RNA:RNA hybrids. By way of example, RNase H is a cellular endonuclease which cleaves the RNA strand of an RNA:DNA duplex. Activation of RNase H, therefore, results in cleavage of the RNA target, thereby greatly enhancing the efficiency of oligonucleotide inhibition of gene expression. Consequently, comparable results can often be obtained with shorter oligonucleotides when chimeric oligonucleotides are used, compared to phosphorothioate deoxyoligonucleotides hybridizing to the same target region. Cleavage of the RNA target can be routinely detected by gel electrophoresis and, if necessary, associated nucleic acid hybridization techniques known in the art.

[0050] Chimeric antisense compounds of the invention may be formed as composite structures of two or more oligonucleotides, modified oligonucleotides, oligonucleosides and/or oligonucleotide mimetics as described above. Such compounds have also been referred to in the art as hybrids or gapmers. Representative United States patents that teach the preparation of such hybrid structures include, but are not limited to, U.S. Pat. Nos. 5,013,830; 5,149,797; 5,220,007; 5,256,775; 5,366,878; 5,403,711; 5,491,133; 5,565,350; 5,623,065; 5,652,355; 5,652,356; and 5,700,922, certain of which are commonly owned with the instant application, and each of which is herein incorporated by reference in its entirety.

[0051] The antisense compounds used in accordance with this invention may be conveniently and routinely made through the well-known technique of solid phase synthesis. Equipment for such synthesis is sold by several vendors including, for example, Applied Biosystems (Foster City, Calif.). Any other means for such synthesis known in the art may additionally or alternatively be employed. It is well known to use similar techniques to prepare oligonucleotides such as the phosphorothioates and alkylated derivatives.

[0052] The antisense compounds of the invention are synthesized in vitro and do not include antisense compositions of biological origin, or genetic vector constructs designed to direct the in vivo synthesis of antisense molecules. The compounds of the invention may also be admixed, encapsulated, conjugated or otherwise associated with other molecules, molecule structures or mixtures of compounds, as for example, liposomes, receptor targeted molecules, oral, rectal, topical or other formulations, for assisting in uptake, distribution and/or absorption. Representative United States patents that teach the preparation of such uptake, distribution and/or absorption assisting formulations include, but are not limited to, U.S. Pat. Nos. 5,108,921; 5,354,844; 5,416,016; 5,459,127; 5,521,291; 5,543,158; 5,547,932; 5,583,020; 5,591,721; 4,426,330; 4,534,899; 5,013,556; 5,108,921; 5,213,804; 5,227,170; 5,264,221; 5,356,633; 5,395,619; 5,416,016; 5,417,978; 5,462,854; 5,469,854; 5,512,295; 5,527,528; 5,534,259; 5,543,152; 5,556,948; 5,580,575; and 5,595,756, each of which is herein incorporated by reference.

[0053] The antisense compounds of the invention encompass any pharmaceutically acceptable salts, esters, or salts of such esters, or any other compound which, upon administration to an animal including a human, is capable of providing (directly or indirectly) the biologically active metabolite or residue thereof. Accordingly, for example, the disclosure is also drawn to prodrugs and pharmaceutically acceptable salts of the compounds of the invention, pharmaceutically acceptable salts of such prodrugs, and other bioequivalents.

[0054] The term “prodrug” indicates a therapeutic agent that is prepared in an inactive form that is converted to an active form (i.e., drug) within the body or cells thereof by the action of endogenous enzymes or other chemicals and/or conditions. In particular, prodrug versions of the oligonucleotides of the invention are prepared as SATE [(S-acetyl-2-thioethyl) phosphate] derivatives according to the methods disclosed in WO 93/24510 to Gosselin et al., published Dec. 9, 1993 or in WO 94/26764 and U.S. Pat No. 5,770,713 to Imbach et al.

[0055] The term “pharmaceutically acceptable salts” refers to physiologically and pharmaceutically acceptable salts of the compounds of the invention: i.e., salts that retain the desired biological activity of the parent compound and do not impart undesired toxicological effects thereto.

[0056] Pharmaceutically acceptable base addition salts are formed with metals or amines, such as alkali and alkaline earth metals or organic amines. Examples of metals used as cations are sodium, potassium, magnesium, calcium, and the like. Examples of suitable amines are N,N′-dibenzylethylenediamine, chloroprocaine, choline, diethanolamine, dicyclohexylamine, ethylenediamine, N-methylglucamine, and procaine (see, for example, Berge et al., “Pharmaceutical Salts,” J. of Pharma Sci., 1977, 66, 1-19). The base addition salts of said acidic compounds are prepared by contacting the free acid form with a sufficient amount of the desired base to produce the salt in the conventional manner. The free acid form may be regenerated by contacting the salt form with an acid and isolating the free acid in the conventional manner. The free acid forms differ from their respective salt forms somewhat in certain physical properties such as solubility in polar solvents, but otherwise the salts are equivalent to their respective free acid for purposes of the present invention. As used herein, a “pharmaceutical addition salt” includes a pharmaceutically acceptable salt of an acid form of one of the components of the compositions of the invention. These include organic or inorganic acid salts of the amines. Preferred acid salts are the hydrochlorides, acetates, salicylates, nitrates and phosphates. Other suitable pharmaceutically acceptable salts are well known to those skilled in the art and include basic salts of a variety of inorganic and organic acids, such as, for example, with inorganic acids, such as for example hydrochloric acid, hydrobromic acid, sulfuric acid or phosphoric acid; with organic carboxylic, sulfonic, sulfo or phospho acids or N-substituted sulfamic acids, for example acetic acid, propionic acid, glycolic acid, succinic acid, maleic acid, hydroxymaleic acid, methylmaleic acid, fumaric acid, malic acid, tartaric acid, lactic acid, oxalic acid, gluconic acid, glucaric acid, glucuronic acid, citric acid, benzoic acid, cinnamic acid, mandelic acid, salicylic acid, 4-aminosalicylic acid, 2-phenoxybenzoic acid, 2-acetoxybenzoic acid, embonic acid, nicotinic acid or isonicotinic acid; and with amino acids, such as the 20 alpha-amino acids involved in the synthesis of proteins in nature, for example glutamic acid or aspartic acid, and also with phenylacetic acid, methanesulfonic acid, ethanesulfonic acid, 2-hydroxyethanesulfonic acid, ethane-1,2-disulfonic acid, benzenesulfonic acid, 4-methylbenzenesulfonic acid, naphthalene-2-sulfonic acid, naphthalene-1,5-disulfonic acid, 2- or 3-phosphoglycerate, glucose-6-phosphate, N-cyclohexylsulfamic acid (with the formation of cyclamates), or with other acid organic compounds, such as ascorbic acid. Pharmaceutically acceptable salts of compounds may also be prepared with a pharmaceutically acceptable cation. Suitable pharmaceutically acceptable cations are well known to those skilled in the art and include alkaline, alkaline earth, ammonium and quaternary ammonium cations. Carbonates or hydrogen carbonates are also possible.

[0057] For oligonucleotides, preferred examples of pharmaceutically acceptable salts include but are not limited to (a) salts formed with cations such as sodium, potassium, ammonium, magnesium, calcium, polyamines such as spermine and spermidine, etc.; (b) acid addition salts formed with inorganic acids, for example hydrochloric acid, hydrobromic acid, sulfuric acid, phosphoric acid, nitric acid and the like; (c) salts formed with organic acids such as, for example, acetic acid, oxalic acid, tartaric acid, succinic acid, maleic acid, fumaric acid, gluconic acid, citric acid, malic acid, ascorbic acid, benzoic acid, tannic acid, palmitic acid, alginic acid, polyglutamic acid, naphthalenesulfonic acid, methanesulfonic acid, p-toluenesulfonic acid, naphthalenedisulfonic acid, polygalacturonic acid, and the like; and (d) salts formed from elemental anions such as chlorine, bromine, and iodine.

[0058] The antisense compounds of the present invention can be utilized for diagnostics, therapeutics, prophylaxis and as research reagents and kits. For therapeutics, an animal, preferably a human, suspected of having a disease or disorder which can be treated by modulating the expression of NAC is treated by administering antisense compounds in accordance with this invention. The compounds of the invention can be utilized in pharmaceutical compositions by adding an effective amount of an antisense compound to a suitable pharmaceutically acceptable diluent or carrier. Use of the antisense compounds and methods of the invention may also be useful prophylactically, e.g., to prevent or delay infection, inflammation or tumor formation, for example.

[0059] The antisense compounds of the invention are useful for research and diagnostics, because these compounds hybridize to nucleic acids encoding NAC, enabling sandwich and other assays to easily be constructed to exploit this fact. Hybridization of the antisense oligonucleotides of the invention with a nucleic acid encoding NAC can be detected by means known in the art. Such means may include conjugation of an enzyme to the oligonucleotide, radiolabelling of the oligonucleotide or any other suitable detection means. Kits using such detection means for detecting the level of NAC in a sample may also be prepared.

[0060] The present invention also includes pharmaceutical compositions and formulations which include the antisense compounds of the invention. The pharmaceutical compositions of the present invention may be administered in a number of ways depending upon whether local or systemic treatment is desired and upon the area to be treated. Administration may be topical (including ophthalmic and to mucous membranes including vaginal and rectal delivery), pulmonary, e.g., by inhalation or insufflation of powders or aerosols, including by nebulizer; intratracheal, intranasal, epidermal and transdermal), oral or parenteral. Parenteral administration includes intravenous, intraarterial, subcutaneous, intraperitoneal or intramuscular injection or infusion; or intracranial, e.g., intrathecal or intraventricular, administration. Oligonucleotides with at least one 2′-O-methoxyethyl modification are believed to be particularly useful for oral administration.

[0061] Pharmaceutical compositions and formulations for topical administration may include transdermal patches, ointments, lotions, creams, gels, drops, suppositories, sprays, liquids and powders. Conventional pharmaceutical carriers, aqueous, powder or oily bases, thickeners and the like may be necessary or desirable. Coated condoms, gloves and the like may also be useful. Preferred topical formulations include those in which the oligonucleotides of the invention are in admixture with a topical delivery agent such as lipids, liposomes, fatty acids, fatty acid esters, steroids, chelating agents and surfactants. Preferred lipids and liposomes include neutral (e.g. dioleoylphosphatidyl DOPE ethanolamine, dimyristoylphosphatidyl choline DMPC, distearolyphosphatidyl choline) negative (e.g. dimyristoylphosphatidyl glycerol DMPG) and cationic (e.g. dioleoyltetramethylaminopropyl DOTAP and dioleoylphosphatidyl ethanolamine DOTMA). Oligonucleotides of the invention may be encapsulated within liposomes or may form complexes thereto, in particular to cationic liposomes. Alternatively, oligonucleotides may be complexed to lipids, in particular to cationic lipids. Preferred fatty acids and esters include but are not limited arachidonic acid, oleic acid, eicosanoic acid, lauric acid, caprylic acid, capric acid, myristic acid, palmitic acid, stearic acid, linoleic acid, linolenic acid, dicaprate, tricaprate, monoolein, dilaurin, glyceryl 1-monocaprate, 1-dodecylazacycloheptan-2-one, an acylcarnitine, an acylcholine, or a C₁₋₁₀ alkyl ester (e.g. isopropylmyristate IPM), monoglyceride, diglyceride or pharmaceutically acceptable salt thereof. Topical formulations are described in detail in U.S. patent application Ser. No. 09/315,298 filed on May 20, 1999 which is incorporated herein by reference in its entirety.

[0062] Compositions and formulations for oral administration include powders or granules, microparticulates, nanoparticulates, suspensions or solutions in water or non-aqueous media, capsules, gel capsules, sachets, tablets or minitablets. Thickeners, flavoring agents, diluents, emulsifiers, dispersing aids or binders may be desirable. Preferred oral formulations are those in which oligonucleotides of the invention are administered in conjunction with one or more penetration enhancers surfactants and chelators. Preferred surfactants include fatty acids and/or esters or salts thereof, bile acids and/or salts thereof. Prefered bile acids/salts include chenodeoxycholic acid (CDCA) and ursodeoxychenodeoxycholic acid (UDCA), cholic acid, dehydrocholic acid, deoxycholic acid, glucholic acid, glycholic acid, glycodeoxycholic acid, taurocholic acid, taurodeoxycholic acid, sodium tauro-24,25-dihydro-fusidate, sodium glycodihydrofusidate,. Prefered fatty acids include arachidonic acid, undecanoic acid, oleic acid, lauric acid, caprylic acid, capric acid, myristic acid, palmitic acid, stearic acid, linoleic acid, linolenic acid, dicaprate, tricaprate, monoolein, dilaurin, glyceryl 1-monocaprate, 1-dodecylazacycloheptan-2-one, an acylcarnitine, an acylcholine, or a monoglyceride, a diglyceride or a pharmaceutically acceptable salt thereof (e.g. sodium). Also prefered are combinations of penetration enhancers, for example, fatty acids/salts in combination with bile acids/salts. A particularly prefered combination is the sodium salt of lauric acid, capric acid and UDCA. Further penetration enhancers include polyoxyethylene-9-lauryl ether, polyoxyethylene-20-cetyl ether. Oligonucleotides of the invention may be delivered orally in granular form including sprayed dried particles, or complexed to form micro or nanoparticles. Oligonucleotide complexing agents include poly-amino acids; polyimines; polyacrylates; polyalkylacrylates, polyoxethanes, polyalkylcyanoacrylates; cationized gelatins, albumins, starches, acrylates, polyethyleneglycols (PEG) and starches; polyalkylcyanoacrylates; DEAE-derivatized polyimines, pollulans, celluloses and starches. Particularly preferred complexing agents include chitosan, N-trimethylchitosan, poly-L-lysine, polyhistidine, polyornithine, polyspermines, protamine, polyvinylpyridine, polythiodiethylamino-methylethylene P(TDAE), polyaminostyrene (e.g. p-amino), poly(methylcyanoacrylate), poly(ethylcyanoacrylate), poly(butylcyanoacrylate), poly(isobutylcyanoacrylate), poly(isohexylcynaoacrylate), DEAE-methacrylate, DEAE-hexylacrylate, DEAE-acrylamide, DEAE-albumin and DEAE-dextran, polymethylacrylate, polyhexylacrylate, poly(D,L-lactic acid), poly(DL-lactic-co-glycolic acid (PLGA), alginate, and polyethyleneglycol (PEG). Oral formulations for oligonucleotides and their preparation are described in detail in U.S. applications Ser. No. 08/886,829 (filed Jul. 1, 1997), Ser. No. 09/108,673 (filed Jul. 1, 1998), Ser. No. 09/256,515 (filed Feb. 23, 1999), Ser. No. 09/082,624 (filed May 21, 1998) and Ser. No. 09/315,298 (filed May 20, 1999) each of which is incorporated herein by reference in their entirety.

[0063] Compositions and formulations for parenteral, intrathecal or intraventricular administration may include sterile aqueous solutions which may also contain buffers, diluents and other suitable additives such as, but not limited to, penetration enhancers, carrier compounds and other pharmaceutically acceptable carriers or excipients.

[0064] Pharmaceutical compositions of the present invention include, but are not limited to, solutions, emulsions, and liposome-containing formulations. These compositions may be generated from a variety of components that include, but are not limited to, preformed liquids, self-emulsifying solids and self-emulsifying semisolids.

[0065] The pharmaceutical formulations of the present invention, which may conveniently be presented in unit dosage form, may be prepared according to conventional techniques well known in the pharmaceutical industry. Such techniques include the step of bringing into association the active ingredients with the pharmaceutical carrier(s) or excipient(s). In general the formulations are prepared by uniformly and intimately bringing into association the active ingredients with liquid carriers or finely divided solid carriers or both, and then, if necessary, shaping the product.

[0066] The compositions of the present invention may be formulated into any of many possible dosage forms such as, but not limited to, tablets, capsules, gel capsules, liquid syrups, soft gels, suppositories, and enemas. The compositions of the present invention may also be formulated as suspensions in aqueous, non-aqueous or mixed media. Aqueous suspensions may further contain substances which increase the viscosity of the suspension including, for example, sodium carboxymethylcellulose, sorbitol and/or dextran. The suspension may also contain stabilizers.

[0067] In one embodiment of the present invention the pharmaceutical compositions may be formulated and used as foams. Pharmaceutical foams include formulations such as, but not limited to, emulsions, microemulsions, creams, jellies and liposomes. While basically similar in nature these formulations vary in the components and the consistency of the final product. The preparation of such compositions and formulations is generally known to those skilled in the pharmaceutical and formulation arts and may be applied to the formulation of the compositions of the present invention.

Emulsions

[0068] The compositions of the present invention may be prepared and formulated as emulsions. Emulsions are typically heterogenous systems of one liquid dispersed in another in the form of droplets usually exceeding 0.1 μm in diameter. (Idson, in Pharmaceutical Dosage Forms, Lieberman, Rieger and Banker (Eds.), 1988, Marcel Dekker, Inc., New York, N.Y., volume 1, p. 199; Rosoff, in Pharmaceutical Dosage Forms, Lieberman, Rieger and Banker (Eds.), 1988, Marcel Dekker, Inc., New York, N.Y., Volume 1, p. 245; Block in Pharmaceutical Dosage Forms, Lieberman, Rieger and Banker (Eds.), 1988, Marcel Dekker, Inc., New York, N.Y., volume 2, p. 335; Higuchi et al., in Remington's Pharmaceutical Sciences, Mack Publishing Co., Easton, Pa., 1985, p. 301). Emulsions are often biphasic systems comprising of two immiscible liquid phases intimately mixed and dispersed with each other. In general, emulsions may be either water-in-oil (w/o) or of the oil-in-water (o/w) variety. When an aqueous phase is finely divided into and dispersed as minute droplets into a bulk oily phase the resulting composition is called a water-in-oil (w/o) emulsion. Alternatively, when an oily phase is finely divided into and dispersed as minute droplets into a bulk aqueous phase the resulting composition is called an oil-in-water (o/w) emulsion. Emulsions may contain additional components in addition to the dispersed phases and the active drug which may be present as a solution in either the aqueous phase, oily phase or itself as a separate phase. Pharmaceutical excipients such as emulsifiers, stabilizers, dyes, and anti-oxidants may also be present in emulsions as needed. Pharmaceutical emulsions may also be multiple emulsions that are comprised of more than two phases such as, for example, in the case of oil-in-water-in-oil (o/w/o) and water-in-oil-in-water (w/o/w) emulsions. Such complex formulations often provide certain advantages that simple binary emulsions do not. Multiple emulsions in which individual oil droplets of an o/w emulsion enclose small water droplets constitute a w/o/w emulsion. Likewise a system of oil droplets enclosed in globules of water stabilized in an oily continuous provides an o/w/o emulsion.

[0069] Emulsions are characterized by little or no thermodynamic stability. Often, the dispersed or discontinuous phase of the emulsion is well dispersed into the external or continuous phase and maintained in this form through the means of emulsifiers or the viscosity of the formulation. Either of the phases of the emulsion may be a semisolid or a solid, as is the case of emulsion-style ointment bases and creams. Other means of stabilizing emulsions entail the use of emulsifiers that may be incorporated into either phase of the emulsion. Emulsifiers may broadly be classified into four categories: synthetic surfactants, naturally occurring emulsifiers, absorption bases, and finely dispersed solids (Idson, in Pharmaceutical Dosage Forms, Lieberman, Rieger and Banker (Eds.), 1988, Marcel Dekker, Inc., New York, N.Y., volume 1, p. 199).

[0070] Synthetic surfactants, also known as surface active agents, have found wide applicability in the formulation of emulsions and have been reviewed in the literature (Rieger, in Pharmaceutical Dosage Forms, Lieberman, Rieger and Banker (Eds.), 1988, Marcel Dekker, Inc., New York, N.Y., volume 1, p. 285; Idson, in Pharmaceutical Dosage Forms, Lieberman, Rieger and Banker (Eds.), Marcel Dekker, Inc., New York, N.Y., 1988, volume 1, p. 199). Surfactants are typically amphiphilic and comprise a hydrophilic and a hydrophobic portion. The ratio of the hydrophilic to the hydrophobic nature of the surfactant has been termed the hydrophile/lipophile balance (HLB) and is a valuable tool in categorizing and selecting surfactants in the preparation of formulations. Surfactants may be classified into different classes based on the nature of the hydrophilic group: nonionic, anionic, cationic and amphoteric (Rieger, in Pharmaceutical Dosage Forms, Lieberman, Rieger and Banker (Eds.), 1988, Marcel Dekker, Inc., New York, N.Y., volume 1, p. 285).

[0071] Naturally occurring emulsifiers used in emulsion formulations include lanolin, beeswax, phosphatides, lecithin and acacia. Absorption bases possess hydrophilic properties such that they can soak up water to form w/o emulsions yet retain their semisolid consistencies, such as anhydrous lanolin and hydrophilic petrolatum. Finely divided solids have also been used as good emulsifiers especially in combination with surfactants and in viscous preparations. These include polar inorganic solids, such as heavy metal hydroxides, nonswelling clays such as bentonite, attapulgite, hectorite, kaolin, montmorillonite, colloidal aluminum silicate and colloidal magnesium aluminum silicate, pigments and nonpolar solids such as carbon or glyceryl tristearate.

[0072] A large variety of non-emulsifying materials are also included in emulsion formulations and contribute to the properties of emulsions. These include fats, oils, waxes, fatty acids, fatty alcohols, fatty esters, humectants, hydrophilic colloids, preservatives and antioxidants (Block, in Pharmaceutical Dosage Forms, Lieberman, Rieger and Banker (Eds.), 1988, Marcel Dekker, Inc., New York, N.Y., volume 1, p. 335; Idson, in Pharmaceutical Dosage Forms, Lieberman, Rieger and Banker (Eds.), 1988, Marcel Dekker, Inc., New York, N.Y., volume 1, p. 199).

[0073] Hydrophilic colloids or hydrocolloids include naturally occurring gums and synthetic polymers such as polysaccharides (for example, acacia, agar, alginic acid, carrageenan, guar gum, karaya gum, and tragacanth), cellulose derivatives (for example, carboxymethylcellulose and carboxypropylcellulose), and synthetic polymers (for example, carbomers, cellulose ethers, and carboxyvinyl polymers). These disperse or swell in water to form colloidal solutions that stabilize emulsions by forming strong interfacial films around the dispersed phase droplets and by increasing the viscosity of the external phase.

[0074] Since emulsions often contain a number of ingredients such as carbohydrates, proteins, sterols and phosphatides that may readily support the growth of microbes, these formulations often incorporate preservatives. Commonly used preservatives included in emulsion formulations include methyl paraben, propyl paraben, quaternary ammonium salts, benzalkonium chloride, esters of p-hydroxybenzoic acid, and boric acid. Antioxidants are also commonly added to emulsion formulations to prevent deterioration of the formulation. Antioxidants used may be free radical scavengers such as tocopherols, alkyl gallates, butylated hydroxyanisole, butylated hydroxytoluene, or reducing agents such as ascorbic acid and sodium metabisulfite, and antioxidant synergists such as citric acid, tartaric acid, and lecithin.

[0075] The application of emulsion formulations via dermatological, oral and parenteral routes and methods for their manufacture have been reviewed in the literature (Idson, in Pharmaceutical Dosage Forms, Lieberman, Rieger and Banker (Eds.), 1988, Marcel Dekker, Inc., New York, N.Y., volume 1, p. 199). Emulsion formulations for oral delivery have been very widely used because of reasons of ease of formulation, efficacy from an absorption and bioavailability standpoint. (Rosoff, in Pharmaceutical Dosage Forms, Lieberman, Rieger and Banker (Eds.), 1988, Marcel Dekker, Inc., New York, N.Y., volume 1, p. 245; Idson, in Pharmaceutical Dosage Forms, Lieberman, Rieger and Banker (Eds.), 1988, Marcel Dekker, Inc., New York, N.Y., volume 1, p. 199). Mineral-oil base laxatives, oil-soluble vitamins and high fat nutritive preparations are among the materials that have commonly been administered orally as o/w emulsions.

[0076] In one embodiment of the present invention, the compositions of oligonucleotides and nucleic acids are formulated as microemulsions. A microemulsion may be defined as a system of water, oil and amphiphile which is a single optically isotropic and thermodynamically stable liquid solution (Rosoff, in Pharmaceutical Dosage Forms, Lieberman, Rieger and Banker (Eds.), 1988, Marcel Dekker, Inc., New York, N.Y., volume 1, p. 245). Typically microemulsions are systems that are prepared by first dispersing an oil in an aqueous surfactant solution and then adding a sufficient amount of a fourth component, generally an intermediate chain-length alcohol to form a transparent system. Therefore, microemulsions have also been described as thermodynamically stable, isotropically clear dispersions of two immiscible liquids that are stabilized by interfacial films of surface-active molecules (Leung and Shah, in: Controlled Release of Drugs: Polymers and Aggregate Systems, Rosoff, M., Ed., 1989, VCH Publishers, New York, pages 185-215). Microemulsions commonly are prepared via a combination of three to five components that include oil, water, surfactant, cosurfactant and electrolyte. Whether the microemulsion is of the water-in-oil (w/o) or an oil-in-water (o/w) type is dependent on the properties of the oil and surfactant used and on the structure and geometric packing of the polar heads and hydrocarbon tails of the surfactant molecules (Schott, in Remington's Pharmaceutical Sciences, Mack Publishing Co., Easton, Pa., 1985, p. 271).

[0077] The phenomenological approach utilizing phase diagrams has been extensively studied and has yielded a comprehensive knowledge, to one skilled in the art, of how to formulate microemulsions (Rosoff, in Pharmaceutical Dosage Forms, Lieberman, Rieger and Banker (Eds.), 1988, Marcel Dekker, Inc., New York, N.Y., volume 1, p. 245; Block, in Pharmaceutical Dosage Forms, Lieberman, Rieger and Banker (Eds.), 1988, Marcel Dekker, Inc., New York, N.Y., volume 1, p. 335). Compared to conventional emulsions, microemulsions offer the advantage of solubilizing water-insoluble drugs in a formulation of thermodynamically stable droplets that are formed spontaneously.

[0078] Surfactants used in the preparation of microemulsions include, but are not limited to, ionic surfactants, non-ionic surfactants, Brij 96, polyoxyethylene oleyl ethers, polyglycerol fatty acid esters, tetraglycerol monolaurate (ML310), tetraglycerol monooleate (MO310), hexaglycerol monooleate (PO310), hexaglycerol pentaoleate (PO500), decaglycerol monocaprate (MCA750), decaglycerol monooleate (MO750), decaglycerol sequioleate (SO750), decaglycerol decaoleate (DAO750), alone or in combination with cosurfactants. The cosurfactant, usually a short-chain alcohol such as ethanol, 1-propanol, and 1-butanol, serves to increase the interfacial fluidity by penetrating into the surfactant film and consequently creating a disordered film because of the void space generated among surfactant molecules. Microemulsions may, however, be prepared without the use of cosurfactants and alcohol-free self-emulsifying microemulsion systems are known in the art. The aqueous phase may typically be, but is not limited to, water, an aqueous solution of the drug, glycerol, PEG300, PEG400, polyglycerols, propylene glycols, and derivatives of ethylene glycol. The oil phase may include, but is not limited to, materials such as Captex 300, Captex 355, Capmul MCM, fatty acid esters, medium chain (C8-C12) mono, di, and tri-glycerides, polyoxyethylated glyceryl fatty acid esters, fatty alcohols, polyglycolized glycerides, saturated polyglycolized C8-C10 glycerides, vegetable oils and silicone oil.

[0079] Microemulsions are particularly of interest from the standpoint of drug solubilization and the enhanced absorption of drugs. Lipid based microemulsions (both o/w and w/o) have been proposed to enhance the oral bioavailability of drugs, including peptides (Constantinides et al., Pharmaceutical Research, 1994, 11, 1385-1390; Ritschel, Meth. Find. Exp. Clin. Pharmacol., 1993, 13, 205). Microemulsions afford advantages of improved drug solubilization, protection of drug from enzymatic hydrolysis, possible enhancement of drug absorption due to surfactant-induced alterations in membrane fluidity and permeability, ease of preparation, ease of oral administration over solid dosage forms, improved clinical potency, and decreased toxicity (Constantinides et al., Pharmaceutical Research, 1994, 11, 1385; Ho et al., J. Pharm. Sci., 1996, 85, 138-143). Often microemulsions may form spontaneously when their components are brought together at ambient temperature. This may be particularly advantageous when formulating thermolabile drugs, peptides or oligonucleotides. Microemulsions have also been effective in the transdermal delivery of active components in both cosmetic and pharmaceutical applications. It is expected that the microemulsion compositions and formulations of the present invention will facilitate the increased systemic absorption of oligonucleotides and nucleic acids from the gastrointestinal tract, as well as improve the local cellular uptake of oligonucleotides and nucleic acids within the gastrointestinal tract, vagina, buccal cavity and other areas of administration.

[0080] Microemulsions of the present invention may also contain additional components and additives such as sorbitan monostearate (Grill 3), Labrasol, and penetration enhancers to improve the properties of the formulation and to enhance the absorption of the oligonucleotides and nucleic acids of the present invention. Penetration enhancers used in the microemulsions of the present invention may be classified as belonging to one of five broad categories—surfactants, fatty acids, bile salts, chelating agents, and non-chelating non-surfactants (Lee et al., Critical Reviews in Therapeutic Drug Carrier Systems, 1991, p. 92). Each of these classes has been discussed above.

Liposomes

[0081] There are many organized surfactant structures besides microemulsions that have been studied and used for the formulation of drugs. These include monolayers, micelles, bilayers and vesicles. Vesicles, such as liposomes, have attracted great interest because of their specificity and the duration of action they offer from the standpoint of drug delivery. As used in the present invention, the term “liposome” means a vesicle composed of amphiphilic lipids arranged in a spherical bilayer or bilayers.

[0082] Liposomes are unilamellar or multilamellar vesicles which have a membrane formed from a lipophilic material and an aqueous interior. The aqueous portion contains the composition to be delivered. Cationic liposomes possess the advantage of being able to fuse to the cell wall. Non-cationic liposomes, although not able to fuse as efficiently with the cell wall, are taken up by macrophages in vivo.

[0083] In order to cross intact mammalian skin, lipid vesicles must pass through a series of fine pores, each with a diameter less than 50 nm, under the influence of a suitable transdermal gradient. Therefore, it is desirable to use a liposome which is highly deformable and able to pass through such fine pores.

[0084] Further advantages of liposomes include; liposomes obtained from natural phospholipids are biocompatible and biodegradable; liposomes can incorporate a wide range of water and lipid soluble drugs; liposomes can protect encapsulated drugs in their internal compartments from metabolism and degradation (Rosoff, in Pharmaceutical Dosage Forms, Lieberman, Rieger and Banker (Eds.), 1988, Marcel Dekker, Inc., New York, N.Y., volume 1, p. 245). Important considerations in the preparation of liposome formulations are the lipid surface charge, vesicle size and the aqueous volume of the liposomes.

[0085] Liposomes are useful for the transfer and delivery of active ingredients to the site of action. Because the liposomal membrane is structurally similar to biological membranes, when liposomes are applied to a tissue, the liposomes start to merge with the cellular membranes. As the merging of the liposome and cell progresses, the liposomal contents are emptied into the cell where the active agent may act.

[0086] Liposomal formulations have been the focus of extensive investigation as the mode of delivery for many drugs. There is growing evidence that for topical administration, liposomes present several advantages over other formulations. Such advantages include reduced side-effects related to high systemic absorption of the administered drug, increased accumulation of the administered drug at the desired target, and the ability to administer a wide variety of drugs, both hydrophilic and hydrophobic, into the skin.

[0087] Several reports have detailed the ability of liposomes to deliver agents including high-molecular weight DNA into the skin. Compounds including analgesics, antibodies, hormones and high-molecular weight DNAs have been administered to the skin. The majority of applications resulted in the targeting of the upper epidermis.

[0088] Liposomes fall into two broad classes. Cationic liposomes are positively charged liposomes which interact with the negatively charged DNA molecules to form a stable complex. The positively charged DNA/liposome complex binds to the negatively charged cell surface and is internalized in an endosome. Due to the acidic pH within the endosome, the liposomes are ruptured, releasing their contents into the cell cytoplasm (Wang et al., Biochem. Biophys. Res. Commun., 1987, 147, 980-985).

[0089] Liposomes which are pH-sensitive or negatively-charged, entrap DNA rather than complex with it. Since both the DNA and the lipid are similarly charged, repulsion rather than complex formation occurs. Nevertheless, some DNA is entrapped within the aqueous interior of these liposomes. pH-sensitive liposomes have been used to deliver DNA encoding the thymidine kinase gene to cell monolayers in culture. Expression of the exogenous gene was detected in the target cells (Zhou et al., Journal of Controlled Release, 1992, 19, 269-274).

[0090] One major type of liposomal composition includes phospholipids other than naturally-derived phosphatidylcholine. Neutral liposome compositions, for example, can be formed from dimyristoyl phosphatidylcholine (DMPC) or dipalmitoyl phosphatidylcholine (DPPC). Anionic liposome compositions generally are formed from dimyristoyl phosphatidylglycerol, while anionic fusogenic liposomes are formed primarily from dioleoyl phosphatidylethanolamine (DOPE). Another type of liposomal composition is formed from phosphatidylcholine (PC) such as, for example, soybean PC, and egg PC. Another type is formed from mixtures of phospholipid and/or phosphatidylcholine and/or cholesterol.

[0091] Several studies have assessed the topical delivery of liposomal drug formulations to the skin. Application of liposomes containing interferon to guinea pig skin resulted in a reduction of skin herpes sores while delivery of interferon via other means (e.g. as a solution or as an emulsion) were ineffective (Weiner et al., Journal of Drug Targeting, 1992, 2, 405-410). Further, an additional study tested the efficacy of interferon administered as part of a liposomal formulation to the administration of interferon using an aqueous system, and concluded that the liposomal formulation was superior to aqueous administration (du Plessis et al., Antiviral Research, 1992, 18, 259-265).

[0092] Non-ionic liposomal systems have also been examined to determine their utility in the delivery of drugs to the skin, in particular systems comprising non-ionic surfactant and cholesterol. Non-ionic liposomal formulations comprising Novasome™ I (glyceryl dilaurate/cholesterol/polyoxyethylene-10-stearyl ether) and Novasome™ II (glyceryl distearate/cholesterol/polyoxyethylene-10-stearyl ether) were used to deliver cyclosporin-A into the dermis of mouse skin. Results indicated that such non-ionic liposomal systems were effective in facilitating the deposition of cyclosporin-A into different layers of the skin (Hu et al. S.T.P. Pharma. Sci., 1994, 4, 6, 466).

[0093] Liposomes also include “sterically stabilized” liposomes, a term which, as used herein, refers to liposomes comprising one or more specialized lipids that, when incorporated into liposomes, result in enhanced circulation lifetimes relative to liposomes lacking such specialized lipids. Examples of sterically stabilized liposomes are those in which part of the vesicle-forming lipid portion of the liposome (A) comprises one or more glycolipids, such as monosialoganglioside G_(M1), or (B) is derivatized with one or more hydrophilic polymers, such as a polyethylene glycol (PEG) moiety. While not wishing to be bound by any particular theory, it is thought in the art that, at least for sterically stabilized liposomes containing gangliosides, sphingomyelin, or PEG-derivatized lipids, the enhanced circulation half-life of these sterically stabilized liposomes derives from a reduced uptake into cells of the reticuloendothelial system (RES) (Allen et al., FEBS Letters, 1987, 223, 42; Wu et al., Cancer Research, 1993, 53, 3765).

[0094] Various liposomes comprising one or more glycolipids are known in the art. Papahadjopoulos et al. (Ann. N.Y. Acad. Sci., 1987, 507, 64) reported the ability of monosialoganglioside G_(M1), galactocerebroside sulfate and phosphatidylinositol to improve blood half-lives of liposomes. These findings were expounded upon by Gabizon et al. (Proc. Natl. Acad. Sci. U.S.A., 1988, 85, 6949). U.S. Pat. No. 4,837,028 and WO 88/04924, both to Allen et al., disclose liposomes comprising (1) sphingomyelin and (2) the ganglioside G_(M1) or a galactocerebroside sulfate ester. U.S. Pat. No. 5,543,152 (Webb et al.) discloses liposomes comprising sphingomyelin. Liposomes comprising 1,2-sn-dimyristoylphosphatidylcholine are disclosed in WO 97/13499 (Lim et al.).

[0095] Many liposomes comprising lipids derivatized with one or more hydrophilic polymers, and methods of preparation thereof, are known in the art. Sunamoto et al. (Bull. Chem. Soc. Jpn., 1980, 53, 2778) described liposomes comprising a nonionic detergent, 2C₁₂15G, that contains a PEG moiety. Illum et al. (FEBS Lett., 1984, 167, 79) noted that hydrophilic coating of polystyrene particles with polymeric glycols results in significantly enhanced blood half-lives. Synthetic phospholipids modified by the attachment of carboxylic groups of polyalkylene glycols (e.g., PEG) are described by Sears (U.S. Pat. Nos. 4,426,330 and 4,534,899). Klibanov et al. (FEBS Lett., 1990, 268, 235) described experiments demonstrating that liposomes comprising phosphatidylethanolamine (PE) derivatized with PEG or PEG stearate have significant increases in blood circulation half-lives. Blume et al. (Biochimica et Biophysica Acta, 1990, 1029, 91) extended such observations to other PEG-derivatized phospholipids, e.g., DSPE-PEG, formed from the combination of distearoylphosphatidylethanolamine (DSPE) and PEG. Liposomes having covalently bound PEG moieties on their external surface are described in European Patent No. EP 0 445 131 B1 and WO 90/04384 to Fisher. Liposome compositions containing 1-20 mole percent of PE derivatized with PEG, and methods of use thereof, are described by Woodle et al. (U.S. Pat. Nos. 5,013,556 and 5,356,633) and Martin et al. (U.S. Pat. No. 5,213,804 and European Patent No. EP 0 496 813 B1). Liposomes comprising a number of other lipid-polymer conjugates are disclosed in WO 91/05545 and U.S. Pat. No. 5,225,212 (both to Martin et al.) and in WO 94/20073 (Zalipsky et al.) Liposomes comprising PEG-modified ceramide lipids are described in WO 96/10391 (Choi et al.). U.S. Pat. No. 5,540,935 (Miyazaki et al.) and U.S. Pat. No. 5,556,948 (Tagawa et al.) describe PEG-containing liposomes that can be further derivatized with functional moieties on their surfaces.

[0096] A limited number of liposomes comprising nucleic acids are known in the art. WO 96/40062 to Thierry et al. discloses methods for encapsulating high molecular weight nucleic acids in liposomes. U.S. Pat. No. 5,264,221 to Tagawa et al. discloses protein-bonded liposomes and asserts that the contents of such liposomes may include an antisense RNA. U.S. Pat. No. 5,665,710 to Rahman et al. describes certain methods of encapsulating oligodeoxynucleotides in liposomes. WO 97/04787 to Love et al. discloses liposomes comprising antisense oligonucleotides targeted to the raf gene.

[0097] Transfersomes are yet another type of liposomes, and are highly deformable lipid aggregates which are attractive candidates for drug delivery vehicles. Transfersomes may be described as lipid droplets which are so highly deformable that they are easily able to penetrate through pores which are smaller than the droplet. Transfersomes are adaptable to the environment in which they are used, e.g. they are self-optimizing (adaptive to the shape of pores in the skin), self-repairing, frequently reach their targets without fragmenting, and often self-loading. To make transfersomes it is possible to add surface edge-activators, usually surfactants, to a standard liposomal composition. Transfersomes have been used to deliver serum albumin to the skin. The transfersome-mediated delivery of serum albumin has been shown to be as effective as subcutaneous injection of a solution containing serum albumin.

[0098] Surfactants find wide application in formulations such as emulsions (including microemulsions) and liposomes. The most common way of classifying and ranking the properties of the many different types of surfactants, both natural and synthetic, is by the use of the hydrophile/lipophile balance (HLB). The nature of the hydrophilic group (also known as the “head”) provides the most useful means for categorizing the different surfactants used in formulations (Rieger, in Pharmaceutical Dosage Forms, Marcel Dekker, Inc., New York, N.Y., 1988, p. 285).

[0099] If the surfactant molecule is not ionized, it is classified as a nonionic surfactant. Nonionic surfactants find wide application in pharmaceutical and cosmetic products and are usable over a wide range of pH values. In general their HLB values range from 2 to about 18 depending on their structure. Nonionic surfactants include nonionic esters such as ethylene glycol esters, propylene glycol esters, glyceryl esters, polyglyceryl esters, sorbitan esters, sucrose esters, and ethoxylated esters. Nonionic alkanolamides and ethers such as fatty alcohol ethoxylates, propoxylated alcohols, and ethoxylated/propoxylated block polymers are also included in this class. The polyoxyethylene surfactants are the most popular members of the nonionic surfactant class.

[0100] If the surfactant molecule carries a negative charge when it is dissolved or dispersed in water, the surfactant is classified as anionic. Anionic surfactants include carboxylates such as soaps, acyl lactylates, acyl amides of amino acids, esters of sulfuric acid such as alkyl sulfates and ethoxylated alkyl sulfates, sulfonates such as alkyl benzene sulfonates, acyl isethionates, acyl taurates and sulfosuccinates, and phosphates. The most important members of the anionic surfactant class are the alkyl sulfates and the soaps.

[0101] If the surfactant molecule carries a positive charge when it is dissolved or dispersed in water, the surfactant is classified as cationic. Cationic surfactants include quaternary ammonium salts and ethoxylated amines. The quaternary ammonium salts are the most used members of this class.

[0102] If the surfactant molecule has the ability to carry either a positive or negative charge, the surfactant is classified as amphoteric. Amphoteric surfactants include acrylic acid derivatives, substituted alkylamides, N-alkylbetaines and phosphatides.

[0103] The use of surfactants in drug products, formulations and in emulsions has been reviewed (Rieger, in Pharmaceutical Dosage Forms, Marcel Dekker, Inc., New York, N.Y., 1988, p. 285).

Penetration Enhancers

[0104] In one embodiment, the present invention employs various penetration enhancers to effect the efficient delivery of nucleic acids, particularly oligonucleotides, to the skin of animals. Most drugs are present in solution in both ionized and nonionized forms. However, usually only lipid soluble or lipophilic drugs readily cross cell membranes. It has been discovered that even non-lipophilic drugs may cross cell membranes if the membrane to be crossed is treated with a penetration enhancer. In addition to aiding the diffusion of non-lipophilic drugs across cell membranes, penetration enhancers also enhance the permeability of lipophilic drugs.

[0105] Penetration enhancers may be classified as belonging to one of five broad categories, i.e., surfactants, fatty acids, bile salts, chelating agents, and non-chelating non-surfactants (Lee et al., Critical Reviews in Therapeutic Drug Carrier Systems, 1991, p.92). Each of the above mentioned classes of penetration enhancers are described below in greater detail.

[0106] Surfactants: In connection with the present invention, surfactants (or “surface-active agents”) are chemical entities which, when dissolved in an aqueous solution, reduce the surface tension of the solution or the interfacial tension between the aqueous solution and another liquid, with the result that absorption of oligonucleotides through the mucosa is enhanced. In addition to bile salts and fatty acids, these penetration enhancers include, for example, sodium lauryl sulfate, polyoxyethylene-9-lauryl ether and polyoxyethylene-20-cetyl ether) (Lee et al., Critical Reviews in Therapeutic Drug Carrier Systems, 1991, p.92); and perfluorochemical emulsions, such as FC-43. Takahashi et al., J. Pharm. Pharmacol., 1988, 40, 252).

[0107] Fatty acids: Various fatty acids and their derivatives which act as penetration enhancers include, for example, oleic acid, lauric acid, capric acid (n-decanoic acid), myristic acid, palmitic acid, stearic acid, linoleic acid, linolenic acid, dicaprate, tricaprate, monoolein (1-monooleoyl-rac-glycerol), dilaurin, caprylic acid, arachidonic acid, glycerol 1-monocaprate, 1-dodecylazacycloheptan-2-one, acylcarnitines, acylcholines, C₁₋₁₀ alkyl esters thereof (e.g., methyl, isopropyl and t-butyl), and mono- and di-glycerides thereof (i.e., oleate, laurate, caprate, myristate, palmitate, stearate, linoleate, etc.) (Lee et al., Critical Reviews in Therapeutic Drug Carrier Systems, 1991, p.92; Muranishi, Critical Reviews in Therapeutic Drug Carrier Systems, 1990, 7, 1-33; El Hariri et al., J. Pharm. Pharmacol., 1992, 44, 651-654).

[0108] Bile salts: The physiological role of bile includes the facilitation of dispersion and absorption of lipids and fat-soluble vitamins (Brunton, Chapter 38 in: Goodman & Gilman's The Pharmacological Basis of Therapeutics, 9th Ed., Hardman et al. Eds., McGraw-Hill, New York, 1996, pp. 934-935). Various natural bile salts, and their synthetic derivatives, act as penetration enhancers. Thus the term “bile salts” includes any of the naturally occurring components of bile as well as any of their synthetic derivatives. The bile salts of the invention include, for example, cholic acid (or its pharmaceutically acceptable sodium salt, sodium cholate), dehydrocholic acid (sodium dehydrocholate), deoxycholic acid (sodium deoxycholate), glucholic acid (sodium glucholate), glycholic acid (sodium glycocholate), glycodeoxycholic acid (sodium glycodeoxycholate), taurocholic acid (sodium taurocholate), taurodeoxycholic acid (sodium taurodeoxycholate), chenodeoxycholic acid (sodium chenodeoxycholate), ursodeoxycholic acid (UDCA), sodium tauro-24,25-dihydro-fusidate (STDHF), sodium glycodihydrofusidate and polyoxyethylene-9-lauryl ether (POE) (Lee et al., Critical Reviews in Therapeutic Drug Carrier Systems, 1991, page 92; Swinyard, Chapter 39 In: Remington's Pharmaceutical Sciences, 18th Ed., Gennaro, ed., Mack Publishing Co., Easton, Pa., 1990, pages 782-783; Muranishi, Critical Reviews in Therapeutic Drug Carrier Systems, 1990, 7, 1-33; Yamamoto et al., J. Pharm. Exp. Ther., 1992, 263, 25; Yamashita et al., J. Pharm. Sci., 1990, 79, 579-583).

[0109] Chelating Agents: Chelating agents, as used in connection with the present invention, can be defined as compounds that remove metallic ions from solution by forming complexes therewith, with the result that absorption of oligonucleotides through the mucosa is enhanced. With regards to their use as penetration enhancers in the present invention, chelating agents have the added advantage of also serving as DNase inhibitors, as most characterized DNA nucleases require a divalent metal ion for catalysis and are thus inhibited by chelating agents (Jarrett, J. Chromatogr., 1993, 618, 315-339). Chelating agents of the invention include but are not limited to disodium ethylenediaminetetraacetate (EDTA), citric acid, salicylates (e.g., sodium salicylate, 5-methoxysalicylate and homovanilate), N-acyl derivatives of collagen, laureth-9 and N-amino acyl derivatives of beta-diketones (enamines)(Lee et al., Critical Reviews in Therapeutic Drug Carrier Systems, 1991, page 92; Muranishi, Critical Reviews in Therapeutic Drug Carrier Systems, 1990, 7, 1-33; Buur et al., J. Control Rel., 1990, 14, 43-51).

[0110] Non-chelating non-surfactants: As used herein, non-chelating non-surfactant penetration enhancing compounds can be defined as compounds that demonstrate insignificant activity as chelating agents or as surfactants but that nonetheless enhance absorption of oligonucleotides through the alimentary mucosa (Muranishi, Critical Reviews in Therapeutic Drug Carrier Systems, 1990, 7, 1-33). This class of penetration enhancers include, for example, unsaturated cyclic ureas, 1-alkyl- and 1-alkenylazacyclo-alkanone derivatives (Lee et al., Critical Reviews in Therapeutic Drug Carrier Systems, 1991, page 92); and non-steroidal anti-inflammatory agents such as diclofenac sodium, indomethacin and phenylbutazone (Yamashita et al., J. Pharm. Pharmacol., 1987, 39, 621-626).

[0111] Agents that enhance uptake of oligonucleotides at the cellular level may also be added to the pharmaceutical and other compositions of the present invention. For example, cationic lipids, such as lipofectin (Junichi et al, U.S. Pat. No. 5,705,188), cationic glycerol derivatives, and polycationic molecules, such as polylysine (Lollo et al., PCT Application WO 97/30731), are also known to enhance the cellular uptake of oligonucleotides.

[0112] Other agents may be utilized to enhance the penetration of the administered nucleic acids, including glycols such as ethylene glycol and propylene glycol, pyrrols such as 2-pyrrol, azones, and terpenes such as limonene and menthone.

Carriers

[0113] Certain compositions of the present invention also incorporate carrier compounds in the formulation. As used herein, “carrier compound” or “carrier” can refer to a nucleic acid, or analog thereof, which is inert (i.e., does not possess biological activity per se) but is recognized as a nucleic acid by in vivo processes that reduce the bioavailability of a nucleic acid having biological activity by, for example, degrading the biologically active nucleic acid or promoting its removal from circulation. The coadministration of a nucleic acid and a carrier compound, typically with an excess of the latter substance, can result in a substantial reduction of the amount of nucleic acid recovered in the liver, kidney or other extracirculatory reservoirs, presumably due to competition between the carrier compound and the nucleic acid for a common receptor. For example, the recovery of a partially phosphorothioate oligonucleotide in hepatic tissue can be reduced when it is coadministered with polyinosinic acid, dextran sulfate, polycytidic acid or 4-acetamido-4′isothiocyano-stilbene-2,2′-disulfonic acid (Miyao et al., Antisense Res. Dev., 1995, 5, 115-121; Takakura et al., Antisense & Nucl. Acid Drug Dev., 1996, 6, 177-183).

Excipients

[0114] In contrast to a carrier compound, a “pharmaceutical carrier” or “excipient” is a pharmaceutically acceptable solvent, suspending agent or any other pharmacologically inert vehicle for delivering one or more nucleic acids to an animal. The excipient may be liquid or solid and is selected, with the planned manner of administration in mind, so as to provide for the desired bulk, consistency, etc., when combined with a nucleic acid and the other components of a given pharmaceutical composition. Typical pharmaceutical carriers include, but are not limited to, binding agents (e.g., pregelatinized maize starch, polyvinylpyrrolidone or hydroxypropyl methylcellulose, etc.); fillers (e.g., lactose and other sugars, microcrystalline cellulose, pectin, gelatin, calcium sulfate, ethyl cellulose, polyacrylates or calcium hydrogen phosphate, etc.); lubricants (e.g., magnesium stearate, talc, silica, colloidal silicon dioxide, stearic acid, metallic stearates, hydrogenated vegetable oils, corn starch, polyethylene glycols, sodium benzoate, sodium acetate, etc.); disintegrants (e.g., starch, sodium starch glycolate, etc.); and wetting agents (e.g., sodium lauryl sulphate, etc.).

[0115] Pharmaceutically acceptable organic or inorganic excipient suitable for non-parenteral administration which do not deleteriously react with nucleic acids can also be used to formulate the compositions of the present invention. Suitable pharmaceutically acceptable carriers include, but are not limited to, water, salt solutions, alcohols, polyethylene glycols, gelatin, lactose, amylose, magnesium stearate, talc, silicic acid, viscous paraffin, hydroxymethylcellulose, polyvinylpyrrolidone and the like.

[0116] Formulations for topical administration of nucleic acids may include sterile and non-sterile aqueous solutions, non-aqueous solutions in common solvents such as alcohols, or solutions of the nucleic acids in liquid or solid oil bases. The solutions may also contain buffers, diluents and other suitable additives. Pharmaceutically acceptable organic or inorganic excipients suitable for non-parenteral administration which do not deleteriously react with nucleic acids can be used.

[0117] Suitable pharmaceutically acceptable excipients include, but are not limited to, water, salt solutions, alcohol, polyethylene glycols, gelatin, lactose, amylose, magnesium stearate, talc, silicic acid, viscous paraffin, hydroxymethylcellulose, polyvinylpyrrolidone and the like.

Other Components

[0118] The compositions of the present invention may additionally contain other adjunct components conventionally found in pharmaceutical compositions, at their art-established usage levels. Thus, for example, the compositions may contain additional, compatible, pharmaceutically-active materials such as, for example, antipruritics, astringents, local anesthetics or anti-inflammatory agents, or may contain additional materials useful in physically formulating various dosage forms of the compositions of the present invention, such as dyes, flavoring agents, preservatives, antioxidants, opacifiers, thickening agents and stabilizers. However, such materials, when added, should not unduly interfere with the biological activities of the components of the compositions of the present invention. The formulations can be sterilized and, if desired, mixed with auxiliary agents, e.g., lubricants, preservatives, stabilizers, wetting agents, emulsifiers, salts for influencing osmotic pressure, buffers, colorings, flavorings and/or aromatic substances and the like which do not deleteriously interact with the nucleic acid(s) of the formulation.

[0119] Aqueous suspensions may contain substances which increase the viscosity of the suspension including, for example, sodium carboxymethylcellulose, sorbitol and/or dextran. The suspension may also contain stabilizers.

[0120] Certain embodiments of the invention provide pharmaceutical compositions containing (a) one or more antisense compounds and (b) one or more other chemotherapeutic agents which function by a non-antisense mechanism. Examples of such chemotherapeutic agents include but are not limited to daunorubicin, daunomycin, dactinomycin, doxorubicin, epirubicin, idarubicin, esorubicin, bleomycin, mafosfamide, ifosfamide, cytosine arabinoside, bis-chloroethylnitrosurea, busulfan, mitomycin C, actinomycin D, mithramycin, prednisone, hydroxyprogesterone, testosterone, tamoxifen, dacarbazine, procarbazine, hexamethylmelamine, pentamethylmelamine, mitoxantrone, amsacrine, chlorambucil, methylcyclohexylnitrosurea, nitrogen mustards, melphalan, cyclophosphamide, 6-mercaptopurine, 6-thioguanine, cytarabine, 5-azacytidine, hydroxyurea, deoxycoformycin, 4-hydroxyperoxycyclophosphoramide, 5-fluorouracil (5-FU), 5-fluorodeoxyuridine (5-FUdR), methotrexate (MTX), colchicine, taxol, vincristine, vinblastine, etoposide (VP-16), trimetrexate, irinotecan, topotecan, gemcitabine, teniposide, cisplatin and diethylstilbestrol (DES). See, generally, The Merck Manual of Diagnosis and Therapy, 15th Ed. 1987, pp. 1206-1228, Berkow et al., eds., Rahway, N.J. When used with the compounds of the invention, such chemotherapeutic agents may be used individually (e.g., 5-FU and oligonucleotide), sequentially (e.g., 5-FU and oligonucleotide for a period of time followed by MTX and oligonucleotide), or in combination with one or more other such chemotherapeutic agents (e.g., 5-FU, MTX and oligonucleotide, or 5-FU, radiotherapy and oligonucleotide). Anti-inflammatory drugs, including but not limited to nonsteroidal anti-inflammatory drugs and corticosteroids, and antiviral drugs, including but not limited to ribivirin, vidarabine, acyclovir and ganciclovir, may also be combined in compositions of the invention. See, generally, The Merck Manual of Diagnosis and Therapy, 15th Ed., Berkow et al., eds., 1987, Rahway, N.J., pages 2499-2506 and 46-49, respectively). Other non-antisense chemotherapeutic agents are also within the scope of this invention. Two or more combined compounds may be used together or sequentially.

[0121] In another related embodiment, compositions of the invention may contain one or more antisense compounds, particularly oligonucleotides, targeted to a first nucleic acid and one or more additional antisense compounds targeted to a second nucleic acid target. Numerous examples of antisense compounds are known in the art. Two or more combined compounds may be used together or sequentially.

[0122] The formulation of therapeutic compositions and their subsequent administration is believed to be within the skill of those in the art. Dosing is dependent on severity and responsiveness of the disease state to be treated, with the course of treatment lasting from several days to several months, or until a cure is effected or a diminution of the disease state is achieved. Optimal dosing schedules can be calculated from measurements of drug accumulation in the body of the patient. Persons of ordinary skill can easily determine optimum dosages, dosing methodologies and repetition rates. Optimum dosages may vary depending on the relative potency of individual oligonucleotides, and can generally be estimated based on EC₅₀s found to be effective in in vitro and in vivo animal models. In general, dosage is from 0.01 ug to 100 g per kg of body weight, and may be given once or more daily, weekly, monthly or yearly, or even once every 2 to 20 years. Persons of ordinary skill in the art can easily estimate repetition rates for dosing based on measured residence times and concentrations of the drug in bodily fluids or tissues. Following successful treatment, it may be desirable to have the patient undergo maintenance therapy to prevent the recurrence of the disease state, wherein the oligonucleotide is administered in maintenance doses, ranging from 0.01 ug to 100 g per kg of body weight, once or more daily, to once every 20 years.

[0123] While the present invention has been described with specificity in accordance with certain of its preferred embodiments, the following examples serve only to illustrate the invention and are not intended to limit the same.

EXAMPLES Example 1 Nucleoside Phosphoramidites for Oligonucleotide Synthesis Deoxy and 2′-alkoxy amidites

[0124] 2′-Deoxy and 2′-methoxy beta-cyanoethyldiisopropyl phosphoramidites were purchased from commercial sources (e.g. Chemgenes, Needham Mass. or Glen Research, Inc. Sterling Va.). Other 2′-O-alkoxy substituted nucleoside amidites are prepared as described in U.S. Pat. No. 5,506,351, herein incorporated by reference. For oligonucleotides synthesized using 2′-alkoxy amidites, the standard cycle for unmodified oligonucleotides was utilized, except the wait step after pulse delivery of tetrazole and base was increased to 360 seconds.

[0125] Oligonucleotides containing 5-methyl-2′-deoxycytidine (5-Me-C) nucleotides were synthesized according to published methods [Sanghvi, et. al., Nucleic Acids Research, 1993, 21, 3197-3203] using commercially available phosphoramidites (Glen Research, Sterling Va. or ChemGenes, Needham Mass.).

2′-Fluoro amidites 2′-Fluorodeoxyadenosine amidites

[0126] 2′-fluoro oligonucleotides were synthesized as described previously [Kawasaki, et. al., J. Med. Chem., 1993, 36, 831-841] and U.S. Pat. No. 5,670,633, herein incorporated by reference. Briefly, the protected nucleoside N6-benzoyl-2′-deoxy-2′-fluoroadenosine was synthesized utilizing commercially available 9-beta-D-arabinofuranosyladenine as starting material and by modifying literature procedures whereby the 2′-alpha-fluoro atom is introduced by a S_(N)2-displacement of a 2′-beta-trityl group. Thus N6-benzoyl-9-beta-D-arabinofuranosyladenine was selectively protected in moderate yield as the 3′,5′-ditetrahydropyranyl (THP) intermediate. Deprotection of the THP and N6-benzoyl groups was accomplished using standard methodologies and standard methods were used to obtain the 5′-dimethoxytrityl-(DMT) and 5′-DMT-3′-phosphoramidite intermediates.

2′-Fluorodeoxyguanosine

[0127] The synthesis of 2′-deoxy-2′-fluoroguanosine was accomplished using tetraisopropyldisiloxanyl (TPDS) protected 9-beta-D-arabinofuranosylguanine as starting material, and conversion to the intermediate diisobutyrylarabinofuranosylguanosine. Deprotection of the TPDS group was followed by protection of the hydroxyl group with THP to give diisobutyryl di-THP protected arabinofuranosylguanine. Selective O-deacylation and triflation was followed by treatment of the crude product with fluoride, then deprotection of the THP groups. Standard methodologies were used to obtain the 5′-DMT- and 5′-DMT-3′-phosphoramidites.

2′-Fluorouridine

[0128] Synthesis of 2′-deoxy-2′-fluorouridine was accomplished by the modification of a literature procedure in which 2,2′-anhydro-1-beta-D-arabinofuranosyluracil was treated with 70% hydrogen fluoride-pyridine. Standard procedures were used to obtain the 5′-DMT and 5′-DMT-3′ phosphoramidites.

2′-Fluorodeoxycytidine

[0129] 2′-deoxy-2′-fluorocytidine was synthesized via amination of 2′-deoxy-2′-fluorouridine, followed by selective protection to give N4-benzoyl-2′-deoxy-2′-fluorocytidine. Standard procedures were used to obtain the 5′-DMT and 5′-DMT-3′ phosphoramidites.

2′-O-(2-Methoxyethyl) modified amidites

[0130] 2′-O-Methoxyethyl-substituted nucleoside amidites are prepared as follows, or alternatively, as per the methods of Martin, P., Helvetica Chimica Acta, 1995, 78, 486-504.

2,2′-Anhydro[1-(beta-D-arabinofuranosyl)-5-methyluridine]

[0131] 5-Methyluridine (ribosylthymine, commercially available through Yamasa, Choshi, Japan) (72.0 g, 0.279 M), diphenylcarbonate (90.0 g, 0.420 M) and sodium bicarbonate (2.0 g, 0.024 M) were added to DMF (300 mL). The mixture was heated to reflux, with stirring, allowing the evolved carbon dioxide gas to be released in a controlled manner. After 1 hour, the slightly darkened solution was concentrated under reduced pressure. The resulting syrup was poured into diethylether (2.5 L), with stirring. The product formed a gum. The ether was decanted and the residue was dissolved in a minimum amount of methanol (ca. 400 mL). The solution was poured into fresh ether (2.5 L) to yield a stiff gum. The ether was decanted and the gum was dried in a vacuum oven (60° C. at 1 mm Hg for 24 h) to give a solid that was crushed to a light tan powder (57 g, 85% crude yield). The NMR spectrum was consistent with the structure, contaminated with phenol as its sodium salt (ca. 5%). The material was used as is for further reactions (or it can be purified further by column chromatography using a gradient of methanol in ethyl acetate (10-25%) to give a white solid, mp 222-4° C.).

2′-O-Methoxyethyl-5-methyluridine

[0132] 2,2′-Anhydro-5-methyluridine (195 g, 0.81 M), tris(2-methoxyethyl)borate (231 g, 0.98 M) and 2-methoxyethanol (1.2 L) were added to a 2 L stainless steel pressure vessel and placed in a pre-heated oil bath at 160° C. After heating for 48 hours at 155-160° C., the vessel was opened and the solution evaporated to dryness and triturated with MeOH (200 mL). The residue was suspended in hot acetone (1 L). The insoluble salts were filtered, washed with acetone (150 mL) and the filtrate evaporated. The residue (280 g) was dissolved in CH₃CN (600 mL) and evaporated. A silica gel column (3 kg) was packed in CH₂Cl₂/acetone/MeOH (20:5:3) containing 0.5% Et₃NH. The residue was dissolved in CH₂Cl₂ (250 mL) and adsorbed onto silica (150 g) prior to loading onto the column. The product was eluted with the packing solvent to give 160 g (63%) of product. Additional material was obtained by reworking impure fractions.

2′-O-Methoxyethyl-5′-O-dimethoxytrityl-5-methyluridine

[0133] 2′-O-Methoxyethyl-5-methyluridine (160 g, 0.506 M) was co-evaporated with pyridine (250 mL) and the dried residue dissolved in pyridine (1.3 L). A first aliquot of dimethoxytrityl chloride (94.3 g, 0.278 M) was added and the mixture stirred at room temperature for one hour. A second aliquot of dimethoxytrityl chloride (94.3 g, 0.278 M) was added and the reaction stirred for an additional one hour. Methanol (170 mL) was then added to stop the reaction. HPLC showed the presence of approximately 70% product. The solvent was evaporated and triturated with CH₃CN (200 mL). The residue was dissolved in CHCl₃ (1.5 L) and extracted with 2×500 mL of saturated NaHCO₃ and 2×500 mL of saturated NaCl. The organic phase was dried over Na₂SO₄, filtered and evaporated. 275 g of residue was obtained. The residue was purified on a 3.5 kg silica gel column, packed and eluted with EtOAc/hexane/acetone (5:5:1) containing 0.5% Et₃NH. The pure fractions were evaporated to give 164 g of product. Approximately 20 g additional was obtained from the impure fractions to give a total yield of 183 g (57%).

3′-O-Acetyl-2′-O-methoxyethyl-5′-O-dimethoxytrityl-5-methyluridine

[0134] 2′-O-Methoxyethyl-5′-O-dimethoxytrityl-5-methyluridine (106 g, 0.167 M), DMF/pyridine (750 mL of a 3:1 mixture prepared from 562 mL of DMF and 188 mL of pyridine) and acetic anhydride (24.38 mL, 0.258 M) were combined and stirred at room temperature for 24 hours. The reaction was monitored by TLC by first quenching the TLC sample with the addition of MeOH. Upon completion of the reaction, as judged by TLC, MeOH (50 mL) was added and the mixture evaporated at 35° C. The residue was dissolved in CHCl₃ (800 mL) and extracted with 2×200 mL of saturated sodium bicarbonate and 2×200 mL of saturated NaCl. The water layers were back extracted with 200 mL of CHCl₃. The combined organics were dried with sodium sulfate and evaporated to give 122 g of residue (approx. 90% product). The residue was purified on a 3.5 kg silica gel column and eluted using EtOAc/hexane(4:1). Pure product fractions were evaporated to yield 96 g (84%). An additional 1.5 g was recovered from later fractions.

3′-O-Acetyl-2′-O-methoxyethyl-5′-O-dimethoxytrityl-5-methyl-4-triazoleuridine

[0135] A first solution was prepared by dissolving 3′-O-acetyl-2′-O-methoxyethyl-5′-O-dimethoxytrityl-5-methyluridine (96 g, 0.144 M) in CH₃CN (700 mL) and set aside. Triethylamine (189 mL, 1.44 M) was added to a solution of triazole (90 g, 1.3 M) in CH₃CN (1 L), cooled to −5° C. and stirred for 0.5 h using an overhead stirrer. POCl₃ was added dropwise, over a 30 minute period, to the stirred solution maintained at 0-10° C., and the resulting mixture stirred for an additional 2 hours. The first solution was added dropwise, over a 45 minute period, to the latter solution. The resulting reaction mixture was stored overnight in a cold room. Salts were filtered from the reaction mixture and the solution was evaporated. The residue was dissolved in EtOAc (1 L) and the insoluble solids were removed by filtration. The filtrate was washed with 1×300 mL of NaHCO₃ and 2×300 mL of saturated NaCl, dried over sodium sulfate and evaporated. The residue was triturated with EtOAc to give the title compound.

2′-O-Methoxyethyl-5′-O-dimethoxytrityl-5-methylcytidine

[0136] A solution of 3′-O-acetyl-2′-O-methoxyethyl-5′-O-dimethoxytrityl-5-methyl-4-triazoleuridine (103 g, 0.141 M) in dioxane (500 mL) and NH₄OH (30 mL) was stirred at room temperature for 2 hours. The dioxane solution was evaporated and the residue azeotroped with MeOH (2×200 mL). The residue was dissolved in MeOH (300 mL) and transferred to a 2 liter stainless steel pressure vessel. MeOH (400 mL) saturated with NH₃ gas was added and the vessel heated to 100° C. for 2 hours (TLC showed complete conversion). The vessel contents were evaporated to dryness and the residue was dissolved in EtOAc (500 mL) and washed once with saturated NaCl (200 mL). The organics were dried over sodium sulfate and the solvent was evaporated to give 85 g (95%) of the title compound.

N4-Benzoyl-2′-O-methoxyethyl-5′-O-dimethoxytrityl-5-methylcytidine

[0137] 2′-O-Methoxyethyl-5′-O-dimethoxytrityl-5-methylcytidine (85 g, 0.134 M) was dissolved in DMF (800 mL) and benzoic anhydride (37.2 g, 0.165 M) was added with stirring. After stirring for 3 hours, TLC showed the reaction to be approximately 95% complete. The solvent was evaporated and the residue azeotroped with MeOH (200 mL). The residue was dissolved in CHCl₃ (700 mL) and extracted with saturated NaHCO₃ (2×300 mL) and saturated NaCl (2×300 mL), dried over MgSO₄ and evaporated to give a residue (96 g). The residue was chromatographed on a 1.5 kg silica column using EtOAc/hexane (1:1) containing 0.5% Et₃NH as the eluting solvent. The pure product fractions were evaporated to give 90 g (90%) of the title compound.

N4-Benzoyl-2′-O-methoxyethyl-5′-O-dimethoxytrityl-5-methylcytidine-3′-amidite

[0138] N4-Benzoyl-2′-O-methoxyethyl-5′-O-dimethoxytrityl-5-methylcytidine (74 g, 0.10 M) was dissolved in CH₂Cl₂ (1 L). Tetrazole diisopropylamine (7.1 g) and 2-cyanoethoxy-tetra-(isopropyl)phosphite (40.5 mL, 0.123 M) were added with stirring, under a nitrogen atmosphere. The resulting mixture was stirred for 20 hours at room temperature (TLC showed the reaction to be 95% complete) The reaction mixture was extracted with saturated NaHCO₃ (1×300 mL) and saturated NaCl (3×300 mL). The aqueous washes were back-extracted with CH₂Cl₂ (300 mL), and the extracts were combined, dried over MgSO₄ and concentrated. The residue obtained was chromatographed on a 1.5 kg silica column using EtOAc/hexane (3:1) as the eluting solvent. The pure fractions were combined to give 90.6 g (87%) of the title compound.

2′-O-(Aminooxyethyl) nucleoside amidites and 2′-O-(dimethylaminooxyethyl) nucleoside amidites 2′-(Dimethylaminooxyethoxy) nucleoside amidites

[0139] 2′-(Dimethylaminooxyethoxy) nucleoside amidites [also known in the art as 2′-O-(dimethylaminooxyethyl) nucleoside amidites] are prepared as described in the following paragraphs. Adenosine, cytidine and guanosine nucleoside amidites are prepared similarly to the thymidine (5-methyluridine) except the exocyclic amines are protected with a benzoyl moiety in the case of adenosine and cytidine and with isobutyryl in the case of guanosine.

5′-O-tert-Butyldiphenylsilyl-O²-2′-anhydro-5-methyluridine

[0140] O²-2′-anhydro-5-methyluridine (Pro. Bio. Sint., Varese, Italy, 100.0 g, 0.416 mmol), dimethylaminopyridine (0.66 g, 0.013 eq, 0.0054 mmol) were dissolved in dry pyridine (500 ml) at ambient temperature under an argon atmosphere and with mechanical stirring. tert-Butyldiphenylchlorosilane (125.8 g, 119.0 mL, 1.1 eq, 0.458 mmol) was added in one portion. The reaction was stirred for 16 h at ambient temperature. TLC (Rf 0.22, ethyl acetate) indicated a complete reaction. The solution was concentrated under reduced pressure to a thick oil. This was partitioned between dichloromethane (1 L) and saturated sodium bicarbonate (2×1 L) and brine (1 L). The organic layer was dried over sodium sulfate and concentrated under reduced pressure to a thick oil. The oil was dissolved in a 1:1 mixture of ethyl acetate and ethyl ether (600 mL) and the solution was cooled to −10° C. The resulting crystalline product was collected by filtration, washed with ethyl ether (3×200 mL) and dried (40° C., 1 mm Hg, 24 h) to 149 g (74.8%) of white solid. TLC and NMR were consistent with pure product.

5′-O-tert-Butyldiphenylsilyl-2′-O-(2-hydroxyethyl)-5-methyluridine

[0141] In a 2 L stainless steel, unstirred pressure reactor was added borane in tetrahydrofuran (1.0 M, 2.0 eq, 622 mL). In the fume hood and with manual stirring, ethylene glycol (350 mL, excess) was added cautiously at first until the evolution of hydrogen gas subsided. 5′-O-tert-Butyldiphenylsilyl-O²-2′-anhydro-5-methyluridine (149 g, 0.311 mol) and sodium bicarbonate (0.074 g, 0.003 eq) were added with manual stirring. The reactor was sealed and heated in an oil bath until an internal temperature of 160° C. was reached and then maintained for 16 h (pressure <100 psig). The reaction vessel was cooled to ambient and opened. TLC (Rf 0.67 for desired product and Rf 0.82 for ara-T side product, ethyl acetate) indicated about 70% conversion to the product. In order to avoid additional side product formation, the reaction was stopped, concentrated under reduced pressure (10 to 1 mm Hg) in a warm water bath (40-100° C.) with the more extreme conditions used to remove the ethylene glycol. [Alternatively, once the low boiling solvent is gone, the remaining solution can be partitioned between ethyl acetate and water. The product will be in the organic phase.] The residue was purified by column chromatography (2 kg silica gel, ethyl acetate-hexanes gradient 1:1 to 4:1). The appropriate fractions were combined, stripped and dried to product as a white crisp foam (84 g, 50%), contaminated starting material (17.4 g) and pure reusable starting material 20 g. The yield based on starting material less pure recovered starting material was 58%. TLC and NMR were consistent with 99% pure product.

2′-O-([2-phthalimidoxy)ethyl]-5′-t-butyldiphenylsilyl-5-methyluridine

[0142] 5′-O-tert-Butyldiphenylsilyl-2′-O-(2-hydroxyethyl)-5-methyluridine (20 g, 36.98 mmol) was mixed with triphenylphosphine (11.63 g, 44.36 mmol) and N-hydroxyphthalimide (7.24 g, 44.36 mmol). It was then dried over P₂O₅ under high vacuum for two days at 40° C. The reaction mixture was flushed with argon and dry THF (369.8 mL, Aldrich, sure seal bottle) was added to get a clear solution. Diethyl-azodicarboxylate (6.98 mL, 44.36 mmol) was added dropwise to the reaction mixture. The rate of addition is maintained such that resulting deep red coloration is just discharged before adding the next drop. After the addition was complete, the reaction was stirred for 4 hrs. By that time TLC showed the completion of the reaction (ethylacetate:hexane, 60:40). The solvent was evaporated in vacuum. Residue obtained was placed on a flash column and eluted with ethyl acetate:hexane (60:40), to get 2′-O-([2-phthalimidoxy)ethyl]-5′-t-butyldiphenylsilyl-5-methyluridine as white foam (21.819 g, 86%).

5′-O-tert-butyldiphenylsilyl-2′-O-[(2-formadoximinooxy)ethyl]-5-methyluridine

[0143] 2′-O-([2-phthalimidoxy)ethyl]-5′-t-butyldiphenylsilyl-5-methyluridine (3.1 g, 4.5 mmol) was dissolved in dry CH₂Cl₂ (4.5 mL) and methylhydrazine (300 mL, 4.64 mmol) was added dropwise at −10° C. to 0° C. After 1 h the mixture was filtered, the filtrate was washed with ice cold CH₂Cl₂ and the combined organic phase was washed with water, brine and dried over anhydrous Na₂SO₄. The solution was concentrated to get 2′-O-(aminooxyethyl) thymidine, which was then dissolved in MeOH (67.5 mL). To this formaldehyde (20% aqueous solution, w/w, 1.1 eq.) was added and the resulting mixture was stirred for 1 h. Solvent was removed under vacuum; residue chromatographed to get 5′-O-tert-butyldiphenylsilyl-2′-O-[(2-formadoximinooxy) ethyl]-5-methyluridine as white foam (1.95 g, 78%).

5′-O-tert-Butyldiphenylsilyl-2′-O-[N,N-dimethylaminooxyethyl]-5-methyluridine

[0144] 5′-O-tert-butyldiphenylsilyl-2′-O-[(2-formadoximinooxy)ethyl]-5-methyluridine (1.77 g, 3.12 mmol) was dissolved in a solution of 1M pyridinium p-toluenesulfonate (PPTS) in dry MeOH (30.6 mL). Sodium cyanoborohydride (0.39 g, 6.13 mmol) was added to this solution at 10° C. under inert atmosphere. The reaction mixture was stirred for 10 minutes at 10° C. After that the reaction vessel was removed from the ice bath and stirred at room temperature for 2 h, the reaction monitored by TLC (5% MeOH in CH₂Cl₂). Aqueous NaHCO₃ solution (5%, 10 mL) was added and extracted with ethyl acetate (2×20 mL). Ethyl acetate phase was dried over anhydrous Na₂SO₄, evaporated to dryness. Residue was dissolved in a solution of 1M PPTS in MeOH (30.6 mL). Formaldehyde (20% w/w, 30 mL, 3.37 mmol) was added and the reaction mixture was stirred at room temperature for 10 minutes. Reaction mixture cooled to 10° C. in an ice bath, sodium cyanoborohydride (0.39 g, 6.13mmol) was added and reaction mixture stirred at 10° C. for 10 minutes. After 10 minutes, the reaction mixture was removed from the ice bath and stirred at room temperature for 2 hrs. To the reaction mixture 5% NaHCO₃ (25 mL) solution was added and extracted with ethyl acetate (2×25 mL). Ethyl acetate layer was dried over anhydrous Na₂SO₄ and evaporated to dryness . The residue obtained was purified by flash column chromatography and eluted with 5% MeOH in CH₂Cl₂ to get 5′-O-tert-butyldiphenylsilyl-2′-O-[N,N-dimethylaminooxyethyl]-5-methyluridine as a white foam (14.6 g, 80%).

2′-O-(dimethylaminooxyethyl)-5-methyluridine

[0145] Triethylamine trihydrofluoride (3.91 mL, 24.0 mmol) was dissolved in dry THF and triethylamine (1.67 mL, 12 mmol, dry, kept over KOH). This mixture of triethylamine-2HF was then added to 5′-O-tert-butyldiphenylsilyl-2′-O-[N,N-dimethylaminooxyethyl]-5-methyluridine (1.40 g, 2.4 mmol) and stirred at room temperature for 24 hrs. Reaction was monitored by TLC (5% MeOH in CH₂Cl₂). Solvent was removed under vacuum and the residue placed on a flash column and eluted with 10% MeOH in CH₂Cl₂ to get 2′-O-(dimethylaminooxyethyl)-5-methyluridine (766 mg, 92.5%).

5′-O-DMT-2′-O-(dimethylaminooxyethyl)-5-methyluridine

[0146] 2′-O-(dimethylaminooxyethyl)-5-methyluridine (750 mg, 2.17 mmol) was dried over P₂O₅ under high vacuum overnight at 40° C. It was then co-evaporated with anhydrous pyridine (20 mL). The residue obtained was dissolved in pyridine (11 mL) under argon atmosphere. 4-dimethylaminopyridine (26.5 mg, 2.60 mmol), 4,4′-dimethoxytrityl chloride (880 mg, 2.60 mmol) was added to the mixture and the reaction mixture was stirred at room temperature until all of the starting material disappeared. Pyridine was removed under vacuum and the residue chromatographed and eluted with 10% MeOH in CH₂Cl₂ (containing a few drops of pyridine) to get 5′-O-DMT-2′-O-(dimethylamino-oxyethyl)-5-methyluridine (1.13 g, 80%).

5′-O-DMT-2′-O-(2-N,N-dimethylaminooxyethyl)-5-methyluridine-3′-[(2-cyanoethyl)-N,N-diisopropylphosphoramidite]

[0147] 5′-O-DMT-2′-O-(dimethylaminooxyethyl)-5-methyluridine (1.08 g, 1.67 mmol) was co-evaporated with toluene (20 mL). To the residue N,N-diisopropylamine tetrazonide (0.29 g, 1.67 mmol) was added and dried over P₂O₅ under high vacuum overnight at 40° C. Then the reaction mixture was dissolved in anhydrous acetonitrile (8.4 mL) and 2-cyanoethyl-N,N,N¹,N¹ tetraisopropylphosphoramidite (2.12 mL, 6.08 mmol) was added. The reaction mixture was stirred at ambient temperature for 4 hrs under inert atmosphere. The progress of the reaction was monitored by TLC (hexane:ethyl acetate 1:1). The solvent was evaporated, then the residue was dissolved in ethyl acetate (70 mL) and washed with 5% aqueous NaHCO₃ (40 mL). Ethyl acetate layer was dried over anhydrous Na₂SO₄ and concentrated. Residue obtained was chromatographed (ethyl acetate as eluent) to get 5′-O-DMT-2′-O-(2-N,N-dimethylaminooxyethyl)-5-methyluridine-3′-[(2-cyanoethyl)-N,N-diisopropylphosphoramidite] as a foam (1.04 g, 74.9%).

2′-(Aminooxyethoxy) nucleoside amidites

[0148] 2′-(Aminooxyethoxy) nucleoside amidites [also known in the art as 2′-O-(aminooxyethyl) nucleoside amidites] are prepared as described in the following paragraphs. Adenosine, cytidine and thymidine nucleoside amidites are prepared similarly.

N2-isobutyryl-6-O-diphenylcarbamoyl-2′-O-(2-ethylacetyl)-5′-O-(4,4′-dimethoxytrityl)guanosine-3′-[(2-cyanoethyl)-N,N-diisopropylphosphoramidite]

[0149] The 2′-O-aminooxyethyl guanosine analog may be obtained by selective 2′-O-alkylation of diaminopurine riboside. Multigram quantities of diaminopurine riboside may be purchased from Schering AG (Berlin) to provide 2′-O-(2-ethylacetyl) diaminopurine riboside along with a minor amount of the 3′-O-isomer. 2′-O-(2-ethylacetyl) diaminopurine riboside may be resolved and converted to 2′-O-(2-ethylacetyl)guanosine by treatment with adenosine deaminase. (McGee, D. P. C., Cook, P. D., Guinosso, C. J., WO 94/02501 A1 940203.) Standard protection procedures should afford 2′-O-(2-ethylacetyl)-5′-O-(4,4′-dimethoxytrityl)guanosine and 2-N-isobutyryl-6-O-diphenylcarbamoyl-2′-O-(2-ethylacetyl)-5′-O-(4,4′-dimethoxytrityl)guanosine which may be reduced to provide 2-N-isobutyryl-6-O-diphenylcarbamoyl-2′-O-(2-hydroxyethyl)-5′-O-(4,4′-dimethoxytrityl)guanosine. As before the hydroxyl group may be displaced by N-hydroxyphthalimide via a Mitsunobu reaction, and the protected nucleoside may phosphitylated as usual to yield 2-N-isobutyryl-6-O-diphenylcarbamoyl-2′-O-([2-phthalmidoxy]ethyl)-5′-O-(4,4′-dimethoxytrityl)guanosine-3′-[(2-cyanoethyl)-N,N-diisopropylphosphoramidite].

2′-dimethylaminoethoxyethoxy (2′-DMAEOE) nucleoside amidites

[0150] 2′-dimethylaminoethoxyethoxy nucleoside amidites (also known in the art as 2′-O-dimethylaminoethoxyethyl, i.e., 2′-O—CH₂—O—CH₂—N(CH₂)₂, or 2′-DMAEOE nucleoside amidites) are prepared as follows. Other nucleoside amidites are prepared similarly.

2′-O-[2(2-N,N-dimethylaminoethoxy)ethyl]-5-methyl uridine

[0151] 2[2-(Dimethylamino)ethoxy]ethanol (Aldrich, 6.66 g, 50 mmol) is slowly added to a solution of borane in tetra-hydrofuran (1 M, 10 mL, 10 mmol) with stirring in a 100 mL bomb. Hydrogen gas evolves as the solid dissolves. O²-2′-anhydro-5-methyluridine (1.2 g, 5 mmol), and sodium bicarbonate (2.5 mg) are added and the bomb is sealed, placed in an oil bath and heated to 155° C. for 26 hours. The bomb is cooled to room temperature and opened. The crude solution is concentrated and the residue partitioned between water (200 mL) and hexanes (200 mL). The excess phenol is extracted into the hexane layer. The aqueous layer is extracted with ethyl acetate (3×200 mL) and the combined organic layers are washed once with water, dried over anhydrous sodium sulfate and concentrated. The residue is columned on silica gel using methanol/methylene chloride 1:20 (which has 2% triethylamine) as the eluent. As the column fractions are concentrated a colorless solid forms which is collected to give the title compound as a white solid.

5′-O-dimethoxytrityl-2′-O-[2(2-N,N-dimethylaminoethoxy) ethyl)]-5-methyl uridine

[0152] To 0.5 g (1.3 mmol) of 2′-O-[2(2-N,N-dimethylamino-ethoxy)ethyl)]-5-methyl uridine in anhydrous pyridine (8 mL), triethylamine (0.36 mL) and dimethoxytrityl chloride (DMT-Cl, 0.87 g, 2 eq.) are added and stirred for 1 hour. The reaction mixture is poured into water (200 mL) and extracted with CH₂Cl₂ (2×200 mL). The combined CH₂Cl₂ layers are washed with saturated NaHCO₃ solution, followed by saturated NaCl solution and dried over anhydrous sodium sulfate. Evaporation of the solvent followed by silica gel chromatography using MeOH:CH₂Cl₂:Et₃N (20:1, v/v, with 1% triethylamine) gives the title compound.

5′-O-Dimethoxytrityl-2′-O-[2(2-N,N-dimethylaminoethoxy)-ethyl)]-5-methyl uridine-3′-O-(cyanoethyl-N,N-diisopropyl)phosphoramidite

[0153] Diisopropylaminotetrazolide (0.6 g) and 2-cyanoethoxy-N,N-diisopropyl phosphoramidite (1.1 mL, 2 eq.) are added to a solution of 5′-O-dimethoxytrityl-2′-O-[2(2-N,N-dimethylaminoethoxy)ethyl)]-5-methyluridine (2.17 g, 3 mmol) dissolved in CH₂Cl₂ (20 mL) under an atmosphere of argon. The reaction mixture is stirred overnight and the solvent evaporated. The resulting residue is purified by silica gel flash column chromatography with ethyl acetate as the eluent to give the title compound.

Example 2 Oligonucleotide Synthesis

[0154] Unsubstituted and substituted phosphodiester (P═O) oligonucleotides are synthesized on an automated DNA synthesizer (Applied Biosystems model 380B) using standard phosphoramidite chemistry with oxidation by iodine.

[0155] Phosphorothioates (P═S) are synthesized as for the phosphodiester oligonucleotides except the standard oxidation bottle was replaced by 0.2 M solution of 3H-1,2-benzodithiole-3-one 1,1-dioxide in acetonitrile for the stepwise thiation of the phosphite linkages. The thiation wait step was increased to 68 sec and was followed by the capping step. After cleavage from the CPG column and deblocking in concentrated ammonium hydroxide at 55° C. (18 h), the oligonucleotides were purified by precipitating twice with 2.5 volumes of ethanol from a 0.5 M NaCl solution.

[0156] Phosphinate oligonucleotides are prepared as described in U.S. Pat. No. 5,508,270, herein incorporated by reference.

[0157] Alkyl phosphonate oligonucleotides are prepared as described in U.S. Pat. No. 4,469,863, herein incorporated by reference.

[0158] 3′-Deoxy-3′-methylene phosphonate oligonucleotides are prepared as described in U.S. Pat. Nos. 5,610,289 or 5,625,050, herein incorporated by reference.

[0159] Phosphoramidite oligonucleotides are prepared as described in U.S. Pat. No. 5,256,775 or U.S. Pat. No. 5,366,878, herein incorporated by reference.

[0160] Alkylphosphonothioate oligonucleotides are prepared as described in published PCT applications PCT/US94/00902 and PCT/US93/06976 (published as WO 94/17093 and WO 94/02499, respectively), herein incorporated by reference. 3′-Deoxy-3′-amino phosphoramidate oligonucleotides are prepared as described in U.S. Pat. No. 5,476,925, herein incorporated by reference.

[0161] Phosphotriester oligonucleotides are prepared as described in U.S. Pat. No. 5,023,243, herein incorporated by reference.

[0162] Borano phosphate oligonucleotides are prepared as described in U.S. Pat. Nos. 5,130,302 and 5,177,198, both herein incorporated by reference.

Example 3 Oligonucleoside Synthesis

[0163] Methylenemethylimino linked oligonucleosides, also identified as MMI linked oligonucleosides, methylenedimethylhydrazo linked oligonucleosides, also identified as MDH linked oligonucleosides, and methylenecarbonylamino linked oligonucleosides, also identified as amide-3 linked oligonucleosides, and methyleneaminocarbonyl linked oligonucleosides, also identified as amide-4 linked oligonucleosides, as well as mixed backbone compounds having, for instance, alternating MMI and P═O or P═S linkages are prepared as described in U.S. Pat. Nos. 5,378,825, 5,386,023, 5,489,677, 5,602,240 and 5,610,289, all of which are herein incorporated by reference.

[0164] Formacetal and thioformacetal linked oligonucleosides are prepared as described in U.S. Pat. Nos. 5,264,562 and 5,264,564, herein incorporated by reference.

[0165] Ethylene oxide linked oligonucleosides are prepared as described in U.S. Pat. No. 5,223,618, herein incorporated by reference.

Example 4 PNA Synthesis

[0166] Peptide nucleic acids (PNAs) are prepared in accordance with any of the various procedures referred to in Peptide Nucleic Acids (PNA): Synthesis, Properties and Potential Applications, Bioorganic & Medicinal Chemistry, 1996, 4, 5-23. They may also be prepared in accordance with U.S. Pat. Nos. 5,539,082, 5,700,922, and 5,719,262, herein incorporated by reference.

Example 5 Synthesis of Chimeric Oligonucleotides

[0167] Chimeric oligonucleotides, oligonucleosides or mixed oligonucleotides/oligonucleosides of the invention can be of several different types. These include a first type wherein the “gap” segment of linked nucleosides is positioned between 5′ and 3′ “wing” segments of linked nucleosides and a second “open end” type wherein the “gap” segment is located at either the 3′ or the 5′ terminus of the oligomeric compound. Oligonucleotides of the first type are also known in the art as “gapmers” or gapped oligonucleotides. Oligonucleotides of the second type are also known in the art as “hemimers” or “wingmers”.

[2′-O-Me]—[2′-deoxy]—[2′-O-Me] Chimeric Phosphorothioate Oligonucleotides

[0168] Chimeric oligonucleotides having 2′-O-alkyl phosphorothioate and 2′-deoxy phosphorothioate oligonucleotide segments are synthesized using an Applied Biosystems automated DNA synthesizer Model 380B, as above. Oligonucleotides are synthesized using the automated synthesizer and 2′-deoxy-5′-dimethoxytrityl-3′-O-phosphoramidite for the DNA portion and 5′-dimethoxytrityl-2′-O-methyl-3′-O-phosphoramidite for 5′ and 3′ wings. The standard synthesis cycle is modified by increasing the wait step after the delivery of tetrazole and base to 600 s repeated four times for RNA and twice for 2′-O-methyl. The fully protected oligonucleotide is cleaved from the support and the phosphate group is deprotected in 3:1 ammonia/ethanol at room temperature overnight then lyophilized to dryness. Treatment in methanolic ammonia for 24 hrs at room temperature is then done to deprotect all bases and sample was again lyophilized to dryness. The pellet is resuspended in 1M TBAF in THF for 24 hrs at room temperature to deprotect the 2′ positions. The reaction is then quenched with 1M TEAA and the sample is then reduced to ½ volume by rotovac before being desalted on a G25 size exclusion column. The oligo recovered is then analyzed spectrophotometrically for yield and for purity by capillary electrophoresis and by mass spectrometry.

[2′-O-(2-Methoxyethyl)]—[2′-deoxy]—[2′-O-(Methoxyethyl)] Chimeric Phosphorothioate Oligonucleotides

[0169] [2′-O-(2-methoxyethyl)]—[2′-deoxy]—[-2′-O-(methoxyethyl)] chimeric phosphorothioate oligonucleotides were prepared as per the procedure above for the 2′-O-methyl chimeric oligonucleotide, with the substitution of 2′-O-(methoxyethyl) amidites for the 2′-O-methyl amidites.

[2′-O-(2-Methoxyethyl)Phosphodiester]—[2′-deoxy Phosphorothioate]—[2′-O-(2-Methoxyethyl) Phosphodiester] Chimeric Oligonucleotides

[0170] [2′-O-(2-methoxyethyl phosphodiester]—[2′-deoxy phosphorothioate]—[2′-O-(methoxyethyl) phosphodiester] chimeric oligonucleotides are prepared as per the above procedure for the 2′-O-methyl chimeric oligonucleotide with the substitution of 2′-O-(methoxyethyl) amidites for the 2′-O-methyl amidites, oxidization with iodine to generate the phosphodiester internucleotide linkages within the wing portions of the chimeric structures and sulfurization utilizing 3, H-1,2 benzodithiole-3-one 1,1 dioxide (Beaucage Reagent) to generate the phosphorothioate internucleotide linkages for the center gap.

[0171] Other chimeric oligonucleotides, chimeric oligonucleo- sides and mixed chimeric oligonucleotides/oligonucleosides are synthesized according to U.S. Pat. No. 5,623,065, herein incorporated by reference.

Example 6 Oligonucleotide Isolation

[0172] After cleavage from the controlled pore glass column (Applied Biosystems) and deblocking in concentrated ammonium hydroxide at 55° C. for 18 hours, the oligonucleotides or oligonucleosides are purified by precipitation twice out of 0.5 M NaCl with 2.5 volumes ethanol. Synthesized oligonucleotides were analyzed by polyacrylamide gel electrophoresis on denaturing gels and judged to be at least 85% full length material. The relative amounts of phosphorothioate and phosphodiester linkages obtained in synthesis were periodically checked by ³¹P nuclear magnetic resonance spectroscopy, and for some studies oligonucleotides were purified by HPLC, as described by Chiang et al., J. Biol. Chem. 1991, 266, 18162-18171. Results obtained with HPLC-purified material were similar to those obtained with non-HPLC purified material.

Example 7 Oligonucleotide Synthesis—96 Well Plate Format

[0173] Oligonucleotides were synthesized via solid phase P(III) phosphoramidite chemistry on an automated synthesizer capable of assembling 96 sequences simultaneously in a standard 96 well format. Phosphodiester internucleotide linkages were afforded by oxidation with aqueous iodine. Phosphorothioate internucleotide linkages were generated by sulfurization utilizing 3,H-1,2 benzodithiole-3-one 1,1 dioxide (Beaucage Reagent) in anhydrous acetonitrile. Standard base-protected beta-cyanoethyldiisopropyl phosphoramidites were purchased from commercial vendors (e.g. PE-Applied Biosystems, Foster City, Calif., or Pharmacia, Piscataway, N.J.). Non-standard nucleosides are synthesized as per known literature or patented methods. They are utilized as base protected beta-cyanoethyldiisopropyl phosphoramidites.

[0174] Oligonucleotides were cleaved from support and deprotected with concentrated NH₄OH at elevated temperature (55-60° C.) for 12-16 hours and the released product then dried in vacuo. The dried product was then re-suspended in sterile water to afford a master plate from which all analytical and test plate samples are then diluted utilizing robotic pipettors.

Example 8 Oligonucleotide Analysis—96 Well Plate Format

[0175] The concentration of oligonucleotide in each well was assessed by dilution of samples and UV absorption spectroscopy. The full-length integrity of the individual products was evaluated by capillary electrophoresis (CE) in either the 96 well format (Beckman P/ACE™ MDQ) or, for individually prepared samples, on a commercial CE apparatus (e.g., Beckman P/ACE™ 5000, ABI 270). Base and backbone composition was confirmed by mass analysis of the compounds utilizing electrospray-mass spectroscopy. All assay test plates were diluted from the master plate using single and multi-channel robotic pipettors. Plates were judged to be acceptable if at least 85% of the compounds on the plate were at least 85% full length.

Example 9 Cell Culture and Oligonucleotide Treatment

[0176] The effect of antisense compounds on target nucleic acid expression can be tested in any of a variety of cell types provided that the target nucleic acid is present at measurable levels. This can be routinely determined using, for example, PCR or Northern blot analysis. The following 4 cell types are provided for illustrative purposes, but other cell types can be routinely used, provided that the target is expressed in the cell type chosen. This can be readily determined by methods routine in the art, for example Northern blot analysis, Ribonuclease protection assays, or RT-PCR.

T-24 Cells

[0177] The human transitional cell bladder carcinoma cell line T-24 was obtained from the American Type Culture Collection (ATCC) (Manassas, Va.). T-24 cells were routinely cultured in complete McCoy's 5A basal media (Gibco/Life Technologies, Gaithersburg, Md.) supplemented with 10% fetal calf serum (Gibco/Life Technologies, Gaithersburg, Md.), penicillin 100 units per mL, and streptomycin 100 micrograms per mL (Gibco/Life Technologies, Gaithersburg, Md.). Cells were routinely passaged by trypsinization and dilution when they reached 90% confluence. Cells were seeded into 96-well plates (Falcon-Primaria #3872) at a density of 7000 cells/well for use in RT-PCR analysis.

[0178] For Northern blotting or other analysis, cells may be seeded onto 100 mm or other standard tissue culture plates and treated similarly, using appropriate volumes of medium and oligonucleotide.

A549 Cells

[0179] The human lung carcinoma cell line A549 was obtained from the American Type Culture Collection (ATCC) (Manassas, Va.). A549 cells were routinely cultured in DMEM basal media (Gibco/Life Technologies, Gaithersburg, Md.) supplemented with 10% fetal calf serum (Gibco/Life Technologies, Gaithersburg, Md.), penicillin 100 units per mL, and streptomycin 100 micrograms per mL (Gibco/Life Technologies, Gaithersburg, Md.). Cells were routinely passaged by trypsinization and dilution when they reached 90% confluence.

NHDF Cells

[0180] Human neonatal dermal fibroblast (NHDF) were obtained from the Clonetics Corporation (Walkersville Md.). NHDFs were routinely maintained in Fibroblast Growth Medium (Clonetics Corporation, Walkersville Md.) supplemented as recommended by the supplier. Cells were maintained for up to 10 passages as recommended by the supplier.

HEK Cells

[0181] Human embryonic keratinocytes (HEK) were obtained from the Clonetics Corporation (Walkersville Md.). HEKs were routinely maintained in Keratinocyte Growth Medium (Clonetics Corporation, Walkersville Md.) formulated as recommended by the supplier. Cells were routinely maintained for up to 10 passages as recommended by the supplier.

Treatment with Antisense Compounds

[0182] When cells reached 80% confluency, they were treated with oligonucleotide. For cells grown in 96-well plates, wells were washed once with 200 μL OPTI-MEM™-1 reduced-serum medium (Gibco BRL) and then treated with 130 μL of OPTI-MEM™-1 containing 3.75 μg/mL LIPOFECTIN™ (Gibco BRL) and the desired concentration of oligonucleotide. After 4-7 hours of treatment, the medium was replaced with fresh medium. Cells were harvested 16-24 hours after oligonucleotide treatment.

[0183] The concentration of oligonucleotide used varies from cell line to cell line. To determine the optimal oligonucleotide concentration for a particular cell line, the cells are treated with a positive control oligonucleotide at a range of concentrations. For human cells the positive control oligonucleotide is ISIS 13920, TCCGTCATCGCTCCTCAGGG, SEQ ID NO: 1, a 2′-O-methoxyethyl gapmer (2′-O-methoxyethyls shown in bold) with a phosphorothioate backbone which is targeted to human H-ras. For mouse or rat cells the positive control oligonucleotide is ISIS 15770, ATGCATTCTGCCCCCAAGGA, SEQ ID NO: 2, a 2′-O-methoxyethyl gapmer (2′-O-methoxyethyls shown in bold) with a phosphorothioate backbone which is targeted to both mouse and rat c-raf. The concentration of positive control oligonucleotide that results in 80% inhibition of c-Ha-ras (for ISIS 13920) or c-raf (for ISIS 15770) mRNA is then utilized as the screening concentration for new oligonucleotides in subsequent experiments for that cell line. If 80% inhibition is not achieved, the lowest concentration of positive control oligonucleotide that results in 60% inhibition of H-ras or c-raf mRNA is then utilized as the oligonucleotide screening concentration in subsequent experiments for that cell line. If 60% inhibition is not achieved, that particular cell line is deemed as unsuitable for oligonucleotide transfection experiments.

Example 10 Analysis of Oligonucleotide Inhibition of NAC Expression

[0184] Antisense modulation of NAC expression can be assayed in a variety of ways known in the art. For example, NAC mRNA levels can be quantitated by, e.g., Northern blot analysis, competitive polymerase chain reaction (PCR), or real-time PCR (RT-PCR). Real-time quantitative PCR is presently preferred. RNA analysis can be performed on total cellular RNA or poly(A)+ mRNA. Methods of RNA isolation are taught in, for example, Ausubel, F. M. et al., Current Protocols in Molecular Biology, Volume 1, pp. 4.1.1-4.2.9 and 4.5.1-4.5.3, John Wiley & Sons, Inc., 1993. Northern blot analysis is routine in the art and is taught in, for example, Ausubel, F. M. et al., Current Protocols in Molecular Biology, Volume 1, pp. 4.2.1-4.2.9, John Wiley & Sons, Inc., 1996. Real-time quantitative (PCR) can be conveniently accomplished using the commercially available ABI PRISM™ 7700 Sequence Detection System, available from PE-Applied Biosystems, Foster City, Calif. and used according to manufacturer's instructions.

[0185] Protein levels of NAC can be quantitated in a variety of ways well known in the art, such as immunoprecipitation, Western blot analysis (immunoblotting), ELISA or fluorescence-activated cell sorting (FACS). Antibodies directed to NAC can be identified and obtained from a variety of sources, such as the MSRS catalog of antibodies (Aerie Corporation, Birmingham, Mich.), or can be prepared via conventional antibody generation methods. Methods for preparation of polyclonal antisera are taught in, for example, Ausubel, F. M. et al., Current Protocols in Molecular Biology, Volume 2, pp. 11.12.1-11.12.9, John Wiley & Sons, Inc., 1997. Preparation of monoclonal antibodies is taught in, for example, Ausubel, F. M. et al., Current Protocols in Molecular Biology, Volume 2, pp. 11.4.1-11.11.5, John Wiley & Sons, Inc., 1997.

[0186] Immunoprecipitation methods are standard in the art and can be found at, for example, Ausubel, F. M. et al., Current Protocols in Molecular Biology, Volume 2, pp. 10.16.1-10.16.11, John Wiley & Sons, Inc., 1998. Western blot (immunoblot) analysis is standard in the art and can be found at, for example, Ausubel, F. M. et al., Current Protocols in Molecular Biology, Volume 2, pp. 10.8.1-10.8.21, John Wiley & Sons, Inc., 1997. Enzyme-linked immunosorbent assays (ELISA) are standard in the art and can be found at, for example, Ausubel, F. M. et al., Current Protocols in Molecular Biology, Volume 2, pp. 11.2.1-11.2.22, John Wiley & Sons, Inc., 1991.

Example 11 Poly(A)+ mRNA Isolation

[0187] Poly(A)+ mRNA was isolated according to Miura et al., Clin. Chem., 1996, 42, 1758-1764. Other methods for poly(A)+ mRNA isolation are taught in, for example, Ausubel, F. M. et al., Current Protocols in Molecular Biology, Volume 1, pp. 4.5.1-4.5.3, John Wiley & Sons, Inc., 1993. Briefly, for cells grown on 96-well plates, growth medium was removed from the cells and each well was washed with 200 μL cold PBS. 60 μL lysis buffer (10 mM Tris-HCl, pH 7.6, 1 mM EDTA, 0.5 M NaCl, 0.5% NP-40, 20 mM vanadyl-ribonucleoside complex) was added to each well, the plate was gently agitated and then incubated at room temperature for five minutes. 55 μL of lysate was transferred to Oligo d(T) coated 96-well plates (AGCT Inc., Irvine Calif.). Plates were incubated for 60 minutes at room temperature, washed 3 times with 200 μL of wash buffer (10 mM Tris-HCl pH 7.6, 1 mM EDTA, 0.3 M NaCl). After the final wash, the plate was blotted on paper towels to remove excess wash buffer and then air-dried for 5 minutes. 60 μL of elution buffer (5 mM Tris-HCl pH 7.6), preheated to 70° C. was added to each well, the plate was incubated on a 90° C. hot plate for 5 minutes, and the eluate was then transferred to a fresh 96-well plate.

[0188] Cells grown on 100 mm or other standard plates may be treated similarly, using appropriate volumes of all solutions.

Example 12 Total RNA Isolation

[0189] Total RNA was isolated using an RNEASY 96™ kit and buffers purchased from Qiagen Inc. (Valencia Calif.) following the manufacturer's recommended procedures. Briefly, for cells grown on 96-well plates, growth medium was removed from the cells and each well was washed with 200 μL cold PBS. 100 μL Buffer RLT was added to each well and the plate vigorously agitated for 20 seconds. 100 μL of 70% ethanol was then added to each well and the contents mixed by pipetting three times up and down. The samples were then transferred to the RNEASY 96™ well plate attached to a QIAVAC™ manifold fitted with a waste collection tray and attached to a vacuum source. Vacuum was applied for 15 seconds. 1 mL of Buffer RW1 was added to each well of the RNEASY 96™ plate and the vacuum again applied for 15 seconds. 1 mL of Buffer RPE was then added to each well of the RNEASY 96™ plate and the vacuum applied for a period of 15 seconds. The Buffer RPE wash was then repeated and the vacuum was applied for an additional 10 minutes. The plate was then removed from the QIAVAC™ manifold and blotted dry on paper towels. The plate was then re-attached to the QIAVAC™ manifold fitted with a collection tube rack containing 1.2 mL collection tubes. RNA was then eluted by pipetting 60 μL water into each well, incubating 1 minute, and then applying the vacuum for 30 seconds. The elution step was repeated with an additional 60 μL water.

[0190] The repetitive pipetting and elution steps may be automated using a QIAGEN Bio-Robot 9604 (Qiagen, Inc., Valencia Calif.). Essentially, after lysing of the cells on the culture plate, the plate is transferred to the robot deck where the pipetting, DNase treatment and elution steps are carried out.

Example 13 Real-time Quantitative PCR Analysis of NAC mRNA Levels

[0191] Quantitation of NAC mRNA levels was determined by real-time quantitative PCR using the ABI PRISM™ 7700 Sequence Detection System (PE-Applied Biosystems, Foster City, Calif.) according to manufacturer's instructions. This is a closed-tube, non-gel-based, fluorescence detection system which allows high-throughput quantitation of polymerase chain reaction (PCR) products in real-time. As opposed to standard PCR, in which amplification products are quantitated after the PCR is completed, products in real-time quantitative PCR are quantitated as they accumulate. This is accomplished by including in the PCR reaction an oligonucleotide probe that anneals specifically between the forward and reverse PCR primers, and contains two fluorescent dyes. A reporter dye (e.g., JOE, FAM, or VIC, obtained from either Operon Technologies Inc., Alameda, Calif. or PE-Applied Biosystems, Foster City, Calif.) is attached to the 5′ end of the probe and a quencher dye (e.g., TAMRA, obtained from either Operon Technologies Inc., Alameda, Calif. or PE-Applied Biosystems, Foster City, Calif.) is attached to the 3′ end of the probe. When the probe and dyes are intact, reporter dye emission is quenched by the proximity of the 3′ quencher dye. During amplification, annealing of the probe to the target sequence creates a substrate that can be cleaved by the 5′-exonuclease activity of Taq polymerase. During the extension phase of the PCR amplification cycle, cleavage of the probe by Taq polymerase releases the reporter dye from the remainder of the probe (and hence from the quencher moiety) and a sequence-specific fluorescent signal is generated. With each cycle, additional reporter dye molecules are cleaved from their respective probes, and the fluorescence intensity is monitored at regular intervals by laser optics built into the ABI PRISM™ 7700 Sequence Detection System. In each assay, a series of parallel reactions containing serial dilutions of mRNA from untreated control samples generates a standard curve that is used to quantitate the percent inhibition after antisense oligonucleotide treatment of test samples.

[0192] Prior to quantitative PCR analysis, primer-probe sets specific to the target gene being measured are evaluated for their ability to be “multiplexed” with a GAPDH amplification reaction. In multiplexing, both the target gene and the internal standard gene GAPDH are amplified concurrently in a single sample. In this analysis, mRNA isolated from untreated cells is serially diluted. Each dilution is amplified in the presence of primer-probe sets specific for GAPDH only, target gene only (“single-plexing”), or both (multiplexing). Following PCR amplification, standard curves of GAPDH and target mRNA signal as a function of dilution are generated from both the single-plexed and multiplexed samples. If both the slope and correlation coefficient of the GAPDH and target signals generated from the multiplexed samples fall within 10% of their corresponding values generated from the single-plexed samples, the primer-probe set specific for that target is deemed multiplexable. Other methods of PCR are also known in the art.

[0193] PCR reagents were obtained from PE-Applied Biosystems, Foster City, Calif. RT-PCR reactions were carried out by adding 25 μL PCR cocktail (1×TAQMAN™ buffer A, 5.5 mM MgCl₂, 300 μM each of DATP, dCTP and dGTP, 600 μM of dUTP, 100 nM each of forward primer, reverse primer, and probe, 20 Units RNAse inhibitor, 1.25 Units AMPLITAQ GOLD™, and 12.5 Units MULV reverse transcriptase) to 96 well plates containing 25 μL total RNA solution. The RT reaction was carried out by incubation for 30 minutes at 48° C. Following a 10 minute incubation at 95° C. to activate the AMPLITAQ GOLD™, 40 cycles of a two-step PCR protocol were carried out: 95° C. for 15 seconds (denaturation) followed by 60° C. for 1.5 minutes (annealing/extension).

[0194] Gene target quantities obtained by real time RT-PCR are normalized using either the expression level of GAPDH, a gene whose expression is constant, or by quantifying total RNA using RiboGreen™ (Molecular Probes, Inc. Eugene, Oreg.). GAPDH expression is quantified by real time RT-PCR, by being run simultaneously with the target, multiplexing, or separately. Total RNA is quantified using RiboGreen™ RNA quantification reagent from Molecular Probes. Methods of RNA quantification by RiboGreen™ are taught in Jones, L. J., et al, Analytical Biochemistry, 1998, 265, 368-374.

[0195] In this assay, 175 μL of RiboGreen™ working reagent (RiboGreen™ reagent diluted 1:2865 in 10 mM Tris-HCl, 1 mM EDTA, pH 7.5) is pipetted into a 96-well plate containing 25 μL purified, cellular RNA. The plate is read in a CytoFluor 4000 (PE Applied Biosystems) with excitation at 480 nm and emission at 520 nm.

[0196] Probes and primers to human NAC were designed to hybridize to a human NAC sequence, using published sequence information (GenBank accession number NM_(—)014922, incorporated herein as SEQ ID NO:3). For human NAC the PCR primers were: forward primer: GTGGAGGAGAATCGAGGACATTT (SEQ ID NO: 4) reverse primer: CCAGTGTTGACTTCCCAATTCC (SEQ ID NO: 5) and the PCR probe was: FAM-CTGGATACCCAAGAACCTCGCATAGTCATACTG-TAMRA (SEQ ID NO: 6) where FAM (PE-Applied Biosystems, Foster City, Calif.) is the fluorescent reporter dye) and TAMRA (PE-Applied Biosystems, Foster City, Calif.) is the quencher dye. For human GAPDH the PCR primers were: forward primer: GAAGGTGAAGGTCGGAGTC (SEQ ID NO: 7) reverse primer: GAAGATGGTGATGGGATTTC (SEQ ID NO: 8) and the PCR probe was: 5′ JOE-CAAGCTTCCCGTTCTCAGCC-TAMRA 3′ (SEQ ID NO: 9) where JOE (PE-Applied Biosystems, Foster City, Calif.) is the fluorescent reporter dye) and TAMPA (PE-Applied Biosystems, Foster City, Calif.) is the quencher dye.

Example 14 Northern Blot Analysis of NAC mRNA Levels

[0197] Eighteen hours after antisense treatment, cell monolayers were washed twice with cold PBS and lysed in 1 mL RNAZOL™ (TEL-TEST “B” Inc., Friendswood, Tex.). Total RNA was prepared following manufacturer's recommended protocols. Twenty micrograms of total RNA was fractionated by electrophoresis through 1.2% agarose gels containing 1.1% formaldehyde using a MOPS buffer system (AMRESCO, Inc. Solon, Ohio). RNA was transferred from the gel to HYBOND™-N+ nylon membranes (Amersham Pharmacia Biotech, Piscataway, N.J.) by overnight capillary transfer using a Northern/Southern Transfer buffer system (TEL-TEST “B” Inc., Friendswood, Tex.). RNA transfer was confirmed by UV visualization. Membranes were fixed by UV cross-linking using a STRATALINKER™ UV Crosslinker 2400 (Stratagene, Inc, La Jolla, Calif.) and then probed using QUICKHYB™ hybridization solution (Stratagene, La Jolla, Calif.) using manufacturer's recommendations for stringent conditions.

[0198] To detect human NAC, a human NAC specific probe was prepared by PCR using the forward primer GTGGAGGAGAATCGAGGACATTT (SEQ ID NO: 4) and the reverse primer CCAGTGTTGACTTCCCAATTCC (SEQ ID NO: 5). To normalize for variations in loading and transfer efficiency membranes were stripped and probed for human glyceraldehyde-3-phosphate dehydrogenase (GAPDH) RNA (Clontech, Palo Alto, Calif.).

[0199] Hybridized membranes were visualized and quantitated using a PHOSPHORIMAGER™ and IMAGEQUANT™ Software V3.3 (Molecular Dynamics, Sunnyvale, Calif.) . Data was normalized to GAPDH levels in untreated controls.

Example 15 Antisense Inhibition of Human NAC Expression by Chimeric Phosphorothioate Oligonucleotides having 2′-MOE Wings and a Deoxy Gap

[0200] In accordance with the present invention, a series of oligonucleotides were designed to target different regions of the human NAC RNA, using published sequences (GenBank accession number NM_(—)014922, incorporated herein as SEQ ID NO: 3, a concatenation of contiguous sequences from Genbank accession numbers AC055839 and AC001237, incorporated herein as SEQ ID NO: 10, exons 1-18 from SEQ ID NO: 10, incorporated herein as SEQ ID NO: 11, GenBank accession number AF298548, incorporated herein as SEQ ID NO: 12, and GenBank accession number AW978535, the complement of which is incorporated herein as SEQ ID NO: 13). The oligonucleotides are shown in Table 1. “Target site” indicates the first (5′-most) nucleotide number on the particular target sequence to which the oligonucleotide binds. All compounds in Table 1 are chimeric oligonucleotides (“gapmers”) 20 nucleotides in length, composed of a central “gap” region consisting of ten 2′-deoxynucleotides, which is flanked on both sides (5′ and 3′ directions) by five-nucleotide “wings”. The wings are composed of 2′-methoxyethyl (2′-MOE)nucleotides. The internucleoside (backbone) linkages are phosphorothioate (P═S) throughout the oligonucleotide. All cytidine residues are 5-methylcytidines. The compounds were analyzed for their effect on human NAC mRNA levels by quantitative real-time PCR as described in other examples herein. Data are averages from two experiments. If present, “N.D.” indicates “no data” TABLE 1 Inhibition of human NAC mRNA levels by chimeric phosphorothioate oligonucleotides having 2′-MOE wings and a deoxy gap TARGET SEQ ID TARGET SEQ ID ISIS # REGION NO SITE SEQUENCE % INHIB NO 156131 5′UTR 3 61 ggaatgggctttcagaaccc 47 14 156132 5′UTR 3 75 gagccgcagagcagggaatg 20 15 156133 5′UTR 3 169 ctgttctgtgttcctcagat 64 16 156134 5′UTR 3 334 gggagtggtaggaaaagcca 0 17 156135 5′UTR 3 406 ccgtctcttattcagcattc 23 18 156136 5′UTR 3 445 gcaggcagagaacagtgctg 58 19 156137 Start 3 503 ctctgtcccggagttaagag 69 20 Codon 156138 Coding 3 581 aagctggaactccttcagct 45 21 156139 Coding 3 717 ggagggctaggtcccaggcc 71 22 156140 Coding 3 883 cccgccggcaattcatggat 62 23 156141 Coding 3 912 aaacccttctctctgagccc 55 24 156142 Coding 3 961 gcagagatttctctccagcg 70 25 156143 Coding 3 1121 ccattgggtcccaggagcct 80 26 156144 Coding 3 1329 ggttaaaatcctcatttttc 57 27 156145 Coding 3 1610 gcagctgaagtagaagacat 50 28 156146 Coding 3 1625 ctgggccagctctctgcagc 71 29 156147 Coding 3 1872 tgatcaggaaggatgcctcg 61 30 156148 Coding 3 2499 acaggccatgtattccatat 44 31 156149 Coding 3 2679 gagtctcgtacaagcagtgg 64 32 156150 Coding 3 2870 ccacctgaacaggactacca 0 33 156151 Coding 3 2905 aagagaatctgccaataggc 61 34 156152 Coding 3 2929 tttctggtgaccttgaggac 66 35 156153 Coding 3 2996 ggtcttacaaagactcttca 63 36 156154 Coding 3 3177 gctgtctcagtctctggcaa 58 37 156155 Coding 3 3212 gctgaccagctgcagtcgct 0 38 156156 Coding 3 3383 ctggtccagccccaggcgta 29 39 156157 Coding 3 3412 tcctgcctcatctcatcact 46 40 156158 Coding 3 3419 cctcagttcctgcctcatct 24 41 156159 Coding 3 3426 ccagggccctcagttcctgc 69 42 156160 Coding 3 3473 acttggtttccgtctgctga 29 43 156161 Coding 3 3544 ctctgccgcttgagtgagga 54 44 156162 Coding 3 3701 cccttgagaggcaggagaag 59 45 156163 Coding 3 3846 gacccgtgttgggccagcgg 0 46 156164 Coding 3 3874 gtcaccgcttctctcatcac 46 47 156165 Coding 3 4011 caaagtgagggaggtgcaca 55 48 156166 Coding 3 4033 acatggcccccttggagagc 32 49 156167 Coding 3 4097 ggctggcttctccaggagca 59 50 156168 Coding 3 4223 gcggtggtaaagcaacacca 70 51 156169 Coding 3 4296 gctccagttccttccgaatg 0 52 156170 Coding 3 4307 tcgatagcagagctccagtt 64 53 156171 Coding 3 4405 accagagtctcatctttctt 59 54 156172 Coding 3 4432 tctcctggtttcaccaaggc 46 55 156173 Coding 3 4438 atgagatctcctggtttcac 26 56 156174 Coding 3 4483 gaaggtacggctatgcgggc 4 57 156175 Coding 3 4565 aacctccaccgatgtcactc 23 58 156176 Coding 3 4575 tgtccaagacaacctccacc 5 59 156177 Coding 3 4701 gtccatctttgcacttccgg 59 60 156178 Coding 3 4747 agttccataatgaggtgagg 5 61 156179 Coding 3 4789 ctgctgagtggcaggagtcc 12 62 156180 Stop 3 4801 ttgatacttcagctgctgag 16 63 Codon 156181 3′UTR 3 4832 aaagccaggactcaagggtc 43 64 156182 3′UTR 3 4908 actttagtgctggaaggcaa 53 65 156183 3′UTR 3 4956 ctggcatggacacataatgc 58 66 156184 3′UTR 3 5020 gctccagatggacattccct 81 67 156185 3′UTR 3 5075 cttggctgctgtggccacca 60 68 156186 3′UTR 3 5140 aggcaaatggttccatgtcc 47 69 156187 Intron 5 10 38815 atacagcatgtagccttttt 74 70 156188 Intron 5 10 45315 cttcttctattgcgcaatct 0 71 156189 Intron 5: 10 53422 gccagccaacctgtggaaga 51 72 Exon 6 156190 Exon 6: 10 53593 gaacaattactgcagtcgct 67 73 Intron 6 156191 Exon 8: 10 58593 taccgtctgctgaagatgag 51 74 Intron 8 156192 Intron 12 10 70616 tccaacttaccttcttgcta 38 75 156193 Exon 13: 10 73926 ttagtgtcaccttccgaatg 34 76 Intron 13 156194 Exon 13: 11 4299 catctatggccttccgaatg 30 77 Exon 14 156195 Exon 14 11 4316 gaatttcatttctagatcat 50 78 156196 Exon 14 11 4331 gattcgcacaaactggaatt 43 79 156197 Exon 14 11 4379 agtgtaacgacagcccatat 84 80 156198 Exon 14: 11 4435 caaagctccagttccttggg 31 81 Exon 15 156199 Exon 18 11 5604 gtctctgttgcacctgaggt 52 82 156200 Exon 18 11 5646 attggtactgccgcaggctg 30 83 156201 Exon 18 11 5654 ctggcttcattggtactgcc 27 84 156202 Exon 18 11 5765 catctcgcatatgattaaga 0 85 156203 Exon 18 11 5836 catctgtatgtgtacaacct 45 86 156204 Exon 18 11 6062 ccttttaaaagaaatattta 0 87 156205 Exon 18 11 6242 ctcggcagtctctgcgcagc 40 88 156206 Exon 18 11 6348 ggtccgccggtcctctcagc 25 89 156207 Start 12 148 ccagccatcttgttctgtgt 53 90 Codon 156208 Exon 16: 13 463 ggtgttccttcctatgcggg 0 91 Exon 18

[0201] As shown in Table 1, SEQ ID NOs 14, 16, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 34, 35, 36, 37, 40, 42, 44, 45, 47, 48, 50, 51, 53, 54, 55, 60, 64, 65, 66, 67, 68, 69, 70, 72, 73, 74, 78, 79, 80, 82, 86, 88 and 90 demonstrated at least 40% inhibition of human NAC expression in this assay and are therefore preferred. The target sites to which these preferred sequences are complementary are herein referred to as “active sites” and are therefore preferred sites for targeting by compounds of the present invention.

Example 16 Western Blot Analysis of NAC Protein Levels

[0202] Western blot analysis (immunoblot analysis) is carried out using standard methods. Cells are harvested 16-20 h after oligonucleotide treatment, washed once with PBS, suspended in Laemmli buffer (100 ul/well), boiled for 5 minutes and loaded on a 16% SDS-PAGE gel. Gels are run for 1.5 hours at 150 V, and transferred to membrane for western blotting. Appropriate primary antibody directed to NAC is used, with a radiolabelled or fluorescently labeled secondary antibody directed against the primary antibody species. Bands are visualized using a PHOSPHORIMAGER ™ (Molecular Dynamics, Sunnyvale Calif.).

1 91 1 20 DNA Artificial Sequence Antisense Oligonucleotide 1 tccgtcatcg ctcctcaggg 20 2 20 DNA Artificial Sequence Antisense Oligonucleotide 2 atgcattctg cccccaagga 20 3 5444 DNA Homo sapiens CDS (523)...(4812) 3 gccccagggc ctggagaggt ctgaagaaac ctgggagcca gcagcccggg gctccactct 60 gggttctgaa agcccattcc ctgctctgcg gctcctccca ccccacctct tctcagcctt 120 gcagctcaag ggttgatctc aggagtccag gacccaggag agggaagaat ctgaggaaca 180 cagaacagtg agcgttgccc acaccccatc tcccgtcacc acatctcccc tcaccctcac 240 cctccctgcc tggccctgga ccccatccca ggacctccct atcagctgac ttcttccagt 300 gtcttgcagg cccctctggg ctcctccctc ccctggcttt tcctaccact ccccctctat 360 cggcgtctat ctgtaggtgc cctgggattt ataaaactgg gttccgaatg ctgaataaga 420 gacggtaaga gccaaggcaa aggacagcac tgttctctgc ctgcctgata ccctcaccac 480 ctgggaacat cccccagaca ccctcttaac tccgggacag ag atg gct ggc gga 534 Met Ala Gly Gly 1 gcc tgg ggc cgc ctg gcc tgt tac ttg gag ttc ctg aag aag gag gag 582 Ala Trp Gly Arg Leu Ala Cys Tyr Leu Glu Phe Leu Lys Lys Glu Glu 5 10 15 20 ctg aag gag ttc cag ctt ctg ctc gcc aat aaa gcg cac tcc agg agc 630 Leu Lys Glu Phe Gln Leu Leu Leu Ala Asn Lys Ala His Ser Arg Ser 25 30 35 tct tcg ggt gag aca ccc gct cag cca gag aag acg agt ggc atg gag 678 Ser Ser Gly Glu Thr Pro Ala Gln Pro Glu Lys Thr Ser Gly Met Glu 40 45 50 gtg gcc tcg tac ctg gtg gct cag tat ggg gag cag cgg gcc tgg gac 726 Val Ala Ser Tyr Leu Val Ala Gln Tyr Gly Glu Gln Arg Ala Trp Asp 55 60 65 cta gcc ctc cat acc tgg gag cag atg ggg ctg agg tca ctg tgc gcc 774 Leu Ala Leu His Thr Trp Glu Gln Met Gly Leu Arg Ser Leu Cys Ala 70 75 80 caa gcc cag gaa ggg gca ggc cac tct ccc tca ttc ccc tac agc cca 822 Gln Ala Gln Glu Gly Ala Gly His Ser Pro Ser Phe Pro Tyr Ser Pro 85 90 95 100 agt gaa ccc cac ctg ggg tct ccc agc caa ccc acc tcc acc gca gtg 870 Ser Glu Pro His Leu Gly Ser Pro Ser Gln Pro Thr Ser Thr Ala Val 105 110 115 cta atg ccc tgg atc cat gaa ttg ccg gcg ggg tgc acc cag ggc tca 918 Leu Met Pro Trp Ile His Glu Leu Pro Ala Gly Cys Thr Gln Gly Ser 120 125 130 gag aga agg gtt ttg aga cag ctg cct gac aca tct gga cgc cgc tgg 966 Glu Arg Arg Val Leu Arg Gln Leu Pro Asp Thr Ser Gly Arg Arg Trp 135 140 145 aga gaa atc tct gcc tca ctc ctc tac caa gct ctt cca agc tcc cca 1014 Arg Glu Ile Ser Ala Ser Leu Leu Tyr Gln Ala Leu Pro Ser Ser Pro 150 155 160 gac cat gag tct cca agc cag gag tca ccc aac gcc ccc aca tcc aca 1062 Asp His Glu Ser Pro Ser Gln Glu Ser Pro Asn Ala Pro Thr Ser Thr 165 170 175 180 gca gtg ctg ggg agc tgg gga tcc cca cct cag ccc agc cta gca ccc 1110 Ala Val Leu Gly Ser Trp Gly Ser Pro Pro Gln Pro Ser Leu Ala Pro 185 190 195 aga gag cag gag gct cct ggg acc caa tgg cct ctg gat gaa acg tca 1158 Arg Glu Gln Glu Ala Pro Gly Thr Gln Trp Pro Leu Asp Glu Thr Ser 200 205 210 gga att tac tac aca gaa atc aga gaa aga gag aga gag aaa tca gag 1206 Gly Ile Tyr Tyr Thr Glu Ile Arg Glu Arg Glu Arg Glu Lys Ser Glu 215 220 225 aaa ggc agg ccc cca tgg gca gcg gtg gta gga acg ccc cca cag gcg 1254 Lys Gly Arg Pro Pro Trp Ala Ala Val Val Gly Thr Pro Pro Gln Ala 230 235 240 cac acc agc cta cag ccc cac cac cac cca tgg gag cct tct gtg aga 1302 His Thr Ser Leu Gln Pro His His His Pro Trp Glu Pro Ser Val Arg 245 250 255 260 gag agc ctc tgt tcc aca tgg ccc tgg aaa aat gag gat ttt aac caa 1350 Glu Ser Leu Cys Ser Thr Trp Pro Trp Lys Asn Glu Asp Phe Asn Gln 265 270 275 aaa ttc aca cag ctg cta ctt cta caa aga cct cac ccc aga agc caa 1398 Lys Phe Thr Gln Leu Leu Leu Leu Gln Arg Pro His Pro Arg Ser Gln 280 285 290 gat ccc ctg gtc aag aga agc tgg cct gat tat gtg gag gag aat cga 1446 Asp Pro Leu Val Lys Arg Ser Trp Pro Asp Tyr Val Glu Glu Asn Arg 295 300 305 gga cat tta att gag atc aga gac tta ttt ggc cca ggc ctg gat acc 1494 Gly His Leu Ile Glu Ile Arg Asp Leu Phe Gly Pro Gly Leu Asp Thr 310 315 320 caa gaa cct cgc ata gtc ata ctg cag ggg gct gct gga att ggg aag 1542 Gln Glu Pro Arg Ile Val Ile Leu Gln Gly Ala Ala Gly Ile Gly Lys 325 330 335 340 tca aca ctg gcc agg cag gtg aag gaa gcc tgg ggg aga ggc cag ctg 1590 Ser Thr Leu Ala Arg Gln Val Lys Glu Ala Trp Gly Arg Gly Gln Leu 345 350 355 tat ggg gac cgc ttc cag cat gtc ttc tac ttc agc tgc aga gag ctg 1638 Tyr Gly Asp Arg Phe Gln His Val Phe Tyr Phe Ser Cys Arg Glu Leu 360 365 370 gcc cag tcc aag gtg gtg agt ctc gct gag ctc atc gga aaa gat ggg 1686 Ala Gln Ser Lys Val Val Ser Leu Ala Glu Leu Ile Gly Lys Asp Gly 375 380 385 aca gcc act ccg gct ccc att aga cag atc ctg tct agg cca gag cgg 1734 Thr Ala Thr Pro Ala Pro Ile Arg Gln Ile Leu Ser Arg Pro Glu Arg 390 395 400 ctg ctc ttc atc ctc gat ggt gta gat gag cca gga tgg gtc ttg cag 1782 Leu Leu Phe Ile Leu Asp Gly Val Asp Glu Pro Gly Trp Val Leu Gln 405 410 415 420 gag ccg agt tct gag ctc tgt ctg cac tgg agc cag cca cag ccg gcg 1830 Glu Pro Ser Ser Glu Leu Cys Leu His Trp Ser Gln Pro Gln Pro Ala 425 430 435 gat gca ctg ctg ggc agt ttg ctg ggg aaa act ata ctt ccc gag gca 1878 Asp Ala Leu Leu Gly Ser Leu Leu Gly Lys Thr Ile Leu Pro Glu Ala 440 445 450 tcc ttc ctg atc acg gct cgg acc aca gct ctg cag aac ctc att cct 1926 Ser Phe Leu Ile Thr Ala Arg Thr Thr Ala Leu Gln Asn Leu Ile Pro 455 460 465 tct ttg gag cag gca cgt tgg gta gag gtc ctg ggg ttc tct gag tcc 1974 Ser Leu Glu Gln Ala Arg Trp Val Glu Val Leu Gly Phe Ser Glu Ser 470 475 480 agc agg aag gaa tat ttc tac aga tat ttc aca gat gaa agg caa gca 2022 Ser Arg Lys Glu Tyr Phe Tyr Arg Tyr Phe Thr Asp Glu Arg Gln Ala 485 490 495 500 att aga gcc ttt agg ttg gtc aaa tca aac aaa gag ctc tgg gcc ctg 2070 Ile Arg Ala Phe Arg Leu Val Lys Ser Asn Lys Glu Leu Trp Ala Leu 505 510 515 tgt ctt gtg ccc tgg gtg tcc tgg ctg gcc tgc act tgc ctg atg cag 2118 Cys Leu Val Pro Trp Val Ser Trp Leu Ala Cys Thr Cys Leu Met Gln 520 525 530 cag atg aag cgg aag gaa aaa ctc aca ctg act tcc aag acc acc aca 2166 Gln Met Lys Arg Lys Glu Lys Leu Thr Leu Thr Ser Lys Thr Thr Thr 535 540 545 acc ctc tgt cta cat tac ctt gcc cag gct ctc caa gct cag cca ttg 2214 Thr Leu Cys Leu His Tyr Leu Ala Gln Ala Leu Gln Ala Gln Pro Leu 550 555 560 gga ccc cag ctc aga gac ctc tgc tct ctg gct gct gag ggc atc tgg 2262 Gly Pro Gln Leu Arg Asp Leu Cys Ser Leu Ala Ala Glu Gly Ile Trp 565 570 575 580 caa aaa aag acc ctt ttc agt cca gat gac ctc agg aag cat ggg tta 2310 Gln Lys Lys Thr Leu Phe Ser Pro Asp Asp Leu Arg Lys His Gly Leu 585 590 595 gat ggg gcc atc atc tcc acc ttc ttg aag atg ggt att ctt caa gag 2358 Asp Gly Ala Ile Ile Ser Thr Phe Leu Lys Met Gly Ile Leu Gln Glu 600 605 610 cac ccc atc cct ctg agc tac agc ttc att cac ctc tgt ttc caa gag 2406 His Pro Ile Pro Leu Ser Tyr Ser Phe Ile His Leu Cys Phe Gln Glu 615 620 625 ttc ttt gca gca atg tcc tat gtc ttg gag gat gag aag ggg aga ggt 2454 Phe Phe Ala Ala Met Ser Tyr Val Leu Glu Asp Glu Lys Gly Arg Gly 630 635 640 aaa cat tct aat tgc atc ata gat ttg gaa aag acg cta gaa gca tat 2502 Lys His Ser Asn Cys Ile Ile Asp Leu Glu Lys Thr Leu Glu Ala Tyr 645 650 655 660 gga ata cat ggc ctg ttt ggg gca tca acc aca cgt ttc cta ttg ggc 2550 Gly Ile His Gly Leu Phe Gly Ala Ser Thr Thr Arg Phe Leu Leu Gly 665 670 675 ctg tta agt gat gag ggg gag aga gag atg gag aac atc ttt cac tgc 2598 Leu Leu Ser Asp Glu Gly Glu Arg Glu Met Glu Asn Ile Phe His Cys 680 685 690 cgg ctg tct cag ggg agg aac ctg atg cag tgg gtc ccg tcc ctg cag 2646 Arg Leu Ser Gln Gly Arg Asn Leu Met Gln Trp Val Pro Ser Leu Gln 695 700 705 ctg ctg ctg cag cca cac tct ctg gag tcc ctc cac tgc ttg tac gag 2694 Leu Leu Leu Gln Pro His Ser Leu Glu Ser Leu His Cys Leu Tyr Glu 710 715 720 act cgg aac aaa acg ttc ctg aca caa gtg atg gcc cat ttc gaa gaa 2742 Thr Arg Asn Lys Thr Phe Leu Thr Gln Val Met Ala His Phe Glu Glu 725 730 735 740 atg ggc atg tgt gta gaa aca gac atg gag ctc tta gtg tgc act ttc 2790 Met Gly Met Cys Val Glu Thr Asp Met Glu Leu Leu Val Cys Thr Phe 745 750 755 tgc att aaa ttc agc cgc cac gtg aag aag ctt cag ctg att gag ggc 2838 Cys Ile Lys Phe Ser Arg His Val Lys Lys Leu Gln Leu Ile Glu Gly 760 765 770 agg cag cac aga tca aca tgg agc ccc acc atg gta gtc ctg ttc agg 2886 Arg Gln His Arg Ser Thr Trp Ser Pro Thr Met Val Val Leu Phe Arg 775 780 785 tgg gtc cca gtc aca gat gcc tat tgg cag att ctc ttc tcc gtc ctc 2934 Trp Val Pro Val Thr Asp Ala Tyr Trp Gln Ile Leu Phe Ser Val Leu 790 795 800 aag gtc acc aga aac ctg aag gag ctg gac cta agt gga aac tcg ctg 2982 Lys Val Thr Arg Asn Leu Lys Glu Leu Asp Leu Ser Gly Asn Ser Leu 805 810 815 820 agc cac tct gca gtg aag agt ctt tgt aag acc ctg aga cgc cct cgc 3030 Ser His Ser Ala Val Lys Ser Leu Cys Lys Thr Leu Arg Arg Pro Arg 825 830 835 tgc ctc ctg gag acc ctg cgg ttg gct ggc tgt ggc ctc aca gct gag 3078 Cys Leu Leu Glu Thr Leu Arg Leu Ala Gly Cys Gly Leu Thr Ala Glu 840 845 850 gac tgc aag gac ctt gcc ttt ggg ctg aga gcc aac cag acc ctg acc 3126 Asp Cys Lys Asp Leu Ala Phe Gly Leu Arg Ala Asn Gln Thr Leu Thr 855 860 865 gag ctg gac ctg agc ttc aat gtg ctc acg gat gct gga gcc aaa cac 3174 Glu Leu Asp Leu Ser Phe Asn Val Leu Thr Asp Ala Gly Ala Lys His 870 875 880 ctt tgc cag aga ctg aga cag ccg agc tgc aag cta cag cga ctg cag 3222 Leu Cys Gln Arg Leu Arg Gln Pro Ser Cys Lys Leu Gln Arg Leu Gln 885 890 895 900 ctg gtc agc tgt ggc ctc acg tct gac tgc tgc cag gac ctg gcc tct 3270 Leu Val Ser Cys Gly Leu Thr Ser Asp Cys Cys Gln Asp Leu Ala Ser 905 910 915 gtg ctt agt gcc agc ccc agc ctg aag gag cta gac ctg cag cag aac 3318 Val Leu Ser Ala Ser Pro Ser Leu Lys Glu Leu Asp Leu Gln Gln Asn 920 925 930 aac ctg gat gac gtt ggc gtg cga ctg ctc tgt gag ggg ctc agg cat 3366 Asn Leu Asp Asp Val Gly Val Arg Leu Leu Cys Glu Gly Leu Arg His 935 940 945 cct gcc tgc aaa ctc ata cgc ctg ggg ctg gac cag aca act ctg agt 3414 Pro Ala Cys Lys Leu Ile Arg Leu Gly Leu Asp Gln Thr Thr Leu Ser 950 955 960 gat gag atg agg cag gaa ctg agg gcc ctg gag cag gag aaa cct cag 3462 Asp Glu Met Arg Gln Glu Leu Arg Ala Leu Glu Gln Glu Lys Pro Gln 965 970 975 980 ctg ctc atc ttc agc aga cgg aaa cca agt gtg atg acc cct act gag 3510 Leu Leu Ile Phe Ser Arg Arg Lys Pro Ser Val Met Thr Pro Thr Glu 985 990 995 ggc ctg gat acg gga gag atg agt aat agc aca tcc tca ctc aag cgg 3558 Gly Leu Asp Thr Gly Glu Met Ser Asn Ser Thr Ser Ser Leu Lys Arg 1000 1005 1010 cag aga ctc gga tca gag agg gcg gct tcc cat gtt gct cag gct aat 3606 Gln Arg Leu Gly Ser Glu Arg Ala Ala Ser His Val Ala Gln Ala Asn 1015 1020 1025 ctc aaa ctc ctg gac gtg agc aag atc ttc cca att gct gag att gca 3654 Leu Lys Leu Leu Asp Val Ser Lys Ile Phe Pro Ile Ala Glu Ile Ala 1030 1035 1040 gag gaa agc tcc cca gag gta gta ccg gtg gaa ctc ttg tgc gtg cct 3702 Glu Glu Ser Ser Pro Glu Val Val Pro Val Glu Leu Leu Cys Val Pro 1045 1050 1055 1060 tct cct gcc tct caa ggg gac ctg cat acg aag cct ttg ggg act gac 3750 Ser Pro Ala Ser Gln Gly Asp Leu His Thr Lys Pro Leu Gly Thr Asp 1065 1070 1075 gat gac ttc tgg ggc ccc acg ggg cct gtg gct act gag gta gtt gac 3798 Asp Asp Phe Trp Gly Pro Thr Gly Pro Val Ala Thr Glu Val Val Asp 1080 1085 1090 aaa gaa aag aac ttg tac cga gtt cac ttc cct gta gct ggc tcc tac 3846 Lys Glu Lys Asn Leu Tyr Arg Val His Phe Pro Val Ala Gly Ser Tyr 1095 1100 1105 cgc tgg ccc aac acg ggt ctc tgc ttt gtg atg aga gaa gcg gtg acc 3894 Arg Trp Pro Asn Thr Gly Leu Cys Phe Val Met Arg Glu Ala Val Thr 1110 1115 1120 gtt gag att gaa ttc tgt gtg tgg gac cag ttc ctg ggt gag atc aac 3942 Val Glu Ile Glu Phe Cys Val Trp Asp Gln Phe Leu Gly Glu Ile Asn 1125 1130 1135 1140 cca cag cac agc tgg atg gtg gca ggg cct ctg ctg gac atc aag gct 3990 Pro Gln His Ser Trp Met Val Ala Gly Pro Leu Leu Asp Ile Lys Ala 1145 1150 1155 gag cct gga gct gtg gaa gct gtg cac ctc cct cac ttt gtg gct ctc 4038 Glu Pro Gly Ala Val Glu Ala Val His Leu Pro His Phe Val Ala Leu 1160 1165 1170 caa ggg ggc cat gtg gac aca tcc ctg ttc caa atg gcc cac ttt aaa 4086 Gln Gly Gly His Val Asp Thr Ser Leu Phe Gln Met Ala His Phe Lys 1175 1180 1185 gag gag ggg atg ctc ctg gag aag cca gcc agg gtg gag ctg cat cac 4134 Glu Glu Gly Met Leu Leu Glu Lys Pro Ala Arg Val Glu Leu His His 1190 1195 1200 ata gtt ctg gaa aac ccc agc ttc tcc ccc ttg gga gtc ctc ctg aaa 4182 Ile Val Leu Glu Asn Pro Ser Phe Ser Pro Leu Gly Val Leu Leu Lys 1205 1210 1215 1220 atg atc cat aat gcc ctg cgc ttc att ccc gtc acc tct gtg gtg ttg 4230 Met Ile His Asn Ala Leu Arg Phe Ile Pro Val Thr Ser Val Val Leu 1225 1230 1235 ctt tac cac cgc gtc cat cct gag gaa gtc acc ttc cac ctc tac ctg 4278 Leu Tyr His Arg Val His Pro Glu Glu Val Thr Phe His Leu Tyr Leu 1240 1245 1250 atc cca agt gac tgc tcc att cgg aag gaa ctg gag ctc tgc tat cga 4326 Ile Pro Ser Asp Cys Ser Ile Arg Lys Glu Leu Glu Leu Cys Tyr Arg 1255 1260 1265 agc cct gga gaa gac cag ctg ttc tcg gag ttc tac gtt ggc cac ttg 4374 Ser Pro Gly Glu Asp Gln Leu Phe Ser Glu Phe Tyr Val Gly His Leu 1270 1275 1280 gga tca ggg atc agg ctg caa gtg aaa gac aag aaa gat gag act ctg 4422 Gly Ser Gly Ile Arg Leu Gln Val Lys Asp Lys Lys Asp Glu Thr Leu 1285 1290 1295 1300 gtg tgg gag gcc ttg gtg aaa cca gga gat ctc atg cct gca act act 4470 Val Trp Glu Ala Leu Val Lys Pro Gly Asp Leu Met Pro Ala Thr Thr 1305 1310 1315 ctg atc cct cca gcc cgc ata gcc gta cct tca cct ctg gat gcc ccg 4518 Leu Ile Pro Pro Ala Arg Ile Ala Val Pro Ser Pro Leu Asp Ala Pro 1320 1325 1330 cag ttg ctg cac ttt gtg gac cag tat cga gag cag ctg ata gcc cga 4566 Gln Leu Leu His Phe Val Asp Gln Tyr Arg Glu Gln Leu Ile Ala Arg 1335 1340 1345 gtg aca tcg gtg gag gtt gtc ttg gac aaa ctg cat gga cag gtg ctg 4614 Val Thr Ser Val Glu Val Val Leu Asp Lys Leu His Gly Gln Val Leu 1350 1355 1360 agc cag gag cag tac gag agg gtg ctg gct gag aac acg agg ccc agc 4662 Ser Gln Glu Gln Tyr Glu Arg Val Leu Ala Glu Asn Thr Arg Pro Ser 1365 1370 1375 1380 cag atg cgg aag ctg ttc agc ttg agc cag tcc tgg gac cgg aag tgc 4710 Gln Met Arg Lys Leu Phe Ser Leu Ser Gln Ser Trp Asp Arg Lys Cys 1385 1390 1395 aaa gat gga ctc tac caa gcc ctg aag gag acc cat cct cac ctc att 4758 Lys Asp Gly Leu Tyr Gln Ala Leu Lys Glu Thr His Pro His Leu Ile 1400 1405 1410 atg gaa ctc tgg gag aag ggc agc aaa aag gga ctc ctg cca ctc agc 4806 Met Glu Leu Trp Glu Lys Gly Ser Lys Lys Gly Leu Leu Pro Leu Ser 1415 1420 1425 agc tga agtatcaaca ccagcccttg acccttgagt cctggctttg gctgaccctt 4862 Ser ctttgggtct cagtttcttt ctctgcaaac aagttgccat ctggtttgcc ttccagcact 4922 aaagtaatgg aactttgatg atgcctttgc tgggcattat gtgtccatgc cagggatgcc 4982 acagggggcc ccagtccagg tggcctaaca gcatctcagg gaatgtccat ctggagctgg 5042 caagacccct gcagacctca tagagcctca tctggtggcc acagcagcca agcctagagc 5102 cctccggatc ccatccaggc gcaaagagga ataggaggga catggaacca tttgcctctg 5162 gctgtgtcac agggtgagcc ccaaaattgg ggttcagcgt gggaggccac gtggattctt 5222 ggctttgtac aggaagatct acaagagcaa gccaacagag taaagtggaa ggaagtttat 5282 tcagaaaata aaggagtatc acagctcttt tagaatttgt ctagcaggct ttccagtttt 5342 taccagaaaa cccctataaa ttaaaaattt tttacttaaa tttaagaatt aaaaaaatac 5402 aaaaaagaaa aaatgaaaat aaaggaataa gaagttacct ac 5444 4 23 DNA Artificial Sequence PCR Primer 4 gtggaggaga atcgaggaca ttt 23 5 22 DNA Artificial Sequence PCR Primer 5 ccagtgttga cttcccaatt cc 22 6 33 DNA Artificial Sequence PCR Probe 6 ctggataccc aagaacctcg catagtcata ctg 33 7 19 DNA Artificial Sequence PCR Primer 7 gaaggtgaag gtcggagtc 19 8 20 DNA Artificial Sequence PCR Primer 8 gaagatggtg atgggatttc 20 9 20 DNA Artificial Sequence PCR Probe 9 caagcttccc gttctcagcc 20 10 96649 DNA Homo sapiens misc_feature (1)...(96649) n = A,T,C or G 10 gtgctgggat tacaggcatg aagcacattt ctgaacccaa atttttatat gtgtgatctc 60 ttcaacccag aaattccttt tctgggactc tgtcccttgg aggtactcac atgtgtgtga 120 taagatacac ttgtgtacga agttggttat caccatgtgg ctggtaacag gaaaatgcaa 180 atgaaagcca gctagatttc catcaaattt gactaatcaa ataaattata gtagatccaa 240 actatgaaaa actcaacaat tacaaaaaga gaagaaagaa cgacttagat ccatatgtac 300 taacatggag ggatctgcaa aatattttaa gaggaaaaaa attcactcct aaaaaactta 360 gatcctatat atttggcaca tagaaaagta actgtgttca tgtaaaactg tgtttttttt 420 ttcgtggaac actgaaacat tcatgtaaca ttggttattc ctgggaagta gatctgatgg 480 tgcagaaagt caaaagaaca tttttatttt tccccctaca tatttctgta ttatattttt 540 ataatcctaa taaatctaaa caacaagatg aaaatgtcat tttaaaagtt acatgtaatt 600 ggatagtagt attgcatcaa tgttttaagc ttcctgaatt ttatcattgt acagtggcat 660 ataagaaaac atcctgattc ttagtagata ctgccaaagt tcttagaagt aaatgttatg 720 ggccagttgt gatggctcat gcctgtaatg ccagcacttc gggatgccaa gggggggggg 780 ggtgcagatc acctgaggtc aggagttcga gaccagcctg gccaacatgg caaaacccca 840 tctctacaaa aatacaaaaa ttagccaggc atggtggcag gcacctgtaa tcccagctac 900 tagagaggct gaggcaggag aaacgcttga acccaggagg cagaggttgt agtgagctga 960 caatcgtgcc actgcactcc agcctgggca acagagcaag cttccatctc aggggaaaaa 1020 aaaaaaaaag aagtaaatgt tatcatatct gcaacttact cccaataatc agcaaacaca 1080 attaaaagat aaaacaatag taaatgttaa taattggtga atgtaggtta aggttatatg 1140 ggagttcact gtacaactct tgacattttc ctgtaagttt gaaatgtttt caaagtaaaa 1200 agggtctctg aaaagttatg catgacttcc tccgtttagc atagagacct gggtgtgcct 1260 tgcctggttt ctaagaggac cttattggtg ggtgttttag gcctataaag caggaacaag 1320 tcattgttct ttactgagat ttaaatgcag ttaagatcaa atgagaatac tcagagtgag 1380 agatgcgttc tgttgtaacc gatgtaaatt gactgtgacc tttggctatc gtgtgttctt 1440 ggaagatggt gatgcgggta tgtgtattct tgtccaagtt cacaggacat taaagcagac 1500 catggtaggg gcaacaccaa gtcctaatat ttccttcact tcacactgga ggaaagacta 1560 gaaggaggaa gaggagtaac agggagagga ggagggctgg atgaactaga ctatgttttt 1620 taacaaaaga tttcacctgt tgatgttcaa agcaccgttt tatctaagcc tgaaacacac 1680 caacttgggg gaaatgaatt gtcaaactat tgagatgatc ttgttgggtc cataacactg 1740 ttggatacag tgagttcctt ttcaaaggtt ccgcttgttc aattctctca ttctttgtcc 1800 tctattttca aagcctaact gttctgcctc cttgccccta gttacagtaa acaaccttcc 1860 agccgttccc aatctgtaac tcacatccat tcccagtctg taacccacat ccattcacaa 1920 tctgaaacaa cccacatctg taccttattt ggccaaaact gctcttcccg ctgctgtagc 1980 ccccacccct gctccatttg aagtagccaa tcaggatcag cttagactgc gcagtccaac 2040 tccagccaat ggggcccgga cacagcatca gggactgact gggtcaggga taaaaacccc 2100 ttccctcctt tgttcagtgt gctcttgcag tggccagaag ggcaagcgag acccttctcc 2160 agaagtaaat ttgccttgct gaaaaaatcc tttatttgag tgctcatttt ctctgtgact 2220 ccgagctttt atttccaaca ataccaagca cacacctgaa gggactagat atttccatgt 2280 cgatcatcct ggaaggagga taatcctggg gcctgtgcta ggagtgatgt gaatgtagga 2340 cagcttgttc tctttctaag tattagagcc agagggtcct ggcctccaaa gggcagagga 2400 ggctgcagaa tcttgaggct gggcaatgac ctgcctgtgt gtaatggaaa gcaccagagc 2460 tacagagggc tccagctgga aaagggatag cagggatgat cctggaccag agagggctct 2520 gaggacagtg tgagccacag ctacagagag tgatgccctg atgtatttac tatccttcat 2580 ccctgaacct tggggaagtt cttgctcaaa atcatatcct catgtcacgg tctcagctgt 2640 gccgtaaatt gggaaaatct tgtcccacat ttcaggaatg ggccagaaaa gagccgtgcc 2700 ctaaaatggt ggtgacagag aggacacctg ggcccagcaa cttcaggagg atagaaatgg 2760 ggtctgagac caaagggacg gcagcaaggt agtcaagacc agtcgatcag aggggaacca 2820 agaactgact tcagaacccc atgaggcaat gcctgggaca cgcttagaaa taggtgggag 2880 gagaggaatt tcaaggaggc tgatttcaag ttatccagat cttgattcaa gtcatagtaa 2940 tttgaaccta aagagaaggc ctcactgtac tctgctggag tactggccat acagatattc 3000 ctcatgagtt aagttttaat tgaaaatagc atgacttgaa taaaagtgag ttttaatgtg 3060 ttcctaaaat gcctcaagga aaggaaacaa aggcccaaag caccagaaac agggataatt 3120 ttaaaaattc aggctcagag agcgccagtg ggtcctctcc accaaaccca tgaggctgct 3180 tcaattctca gaaagtactt gagtgccatc tagtggccag agtgagtagc tgtttggaag 3240 gataagaagg aaagcattaa aggaaagcgt tcaaaagaca ttcaatgtct gtggtattga 3300 ttgtgctgac attgtgggcc aggcactgat ttaggcatag cggatacaac agcgaacaaa 3360 acaagccaag tcctgtctcc agggacctaa catcctattg gcaggagggg gtgacagaag 3420 gtaaacaaat aaacaagtct gtaacgtcag gcaataataa gtgtcatgcg gaaaaatgaa 3480 gcacggaatg aaggaattct actttacaga aggttggata gcacacctct gaatatctgg 3540 ggaaaccatt ccaagacaat ggaaagacta gtgcaaaggc cctggagtga gagtattctt 3600 ggtgtattca aggaacaggg aaaagactaa gaggatccag tgagggaggg ggatagggta 3660 ggatataatg ccggagagat atcaggggct agatcatgca gcccccggga accattggga 3720 gggctctggg agatttgggg agttgttgtt gttgttgttg ttgtttggtt tggtttggtt 3780 tgagactgtg ttttgctctt gtcgcccagg ctggagtgca atggtgcaat ctcggctcac 3840 tgcaatctct gcctcccagg ttcaagcagt tctcctgcct cagcctcctg aaggaatacg 3900 gcgtgcgcca ccacacccgg ataatttttt ttgtattata attattagag atgggatttc 3960 accatgttag ccaggctggt cttgaactcc tggcctcagg tgatctgccc gcctcagcct 4020 cccaaagtgc tgagattaca ggcatgagcc actgtacctg gccggatttg ggaagttttt 4080 taatattatg tcatagggac agggtcttgc catgtttccc agactggtct tgaactcctg 4140 gactcaagcg atcctcccac ctcagcctcc caaagtgctg ggattacagg cacgagtcac 4200 cacacctttg gttggacttt ggtttttact cagagtgaga tggagtatgg gaaagttttg 4260 agctgagaag tgacctgacc taatttatgc tttaagtgga taattctggc ttctagactg 4320 agagtagaca gtaaagggga agcatcagca tgggtggaga acatgtggaa ggctagcaca 4380 gtgatcccag taagagacga ctgaaggctg gaccagcagc cccagtgggc gtggtgggaa 4440 gggtcaggtt tctgatctat ttaacaggtt gagtggatct ccagcggcca gtgatttaat 4500 gaattatgcc tgtgaagtga aacctccgag aaaaccccac agtcttgttt gtttggggtt 4560 tggagagctt ccaggttagt gaagcaggga ggggcaggga gagtagggtg cccgggaggg 4620 caaggaagcg ctgtgcccct cccccaccca caccttgacc tcggcatctc ttccgttggg 4680 ctattcctga gtcgtatcct ccatcatgaa ccgataataa taaataaagt gcttttctca 4740 gtcctgtgag tcatttcagc aaattattga acctgccggg gactgtggga acccttgaat 4800 ttgtcaccaa gtcagacaga gatgccagtg gtctgggcac cccatttgca actggcatct 4860 gatatgaggc cagctcttca ctgaaaccct aaacctgtga ggtttgatgc caactccagg 4920 cagtcagtgt cagaattgca ctgtaggaca ccagttggtg tctagagagt tggtgccaga 4980 actggctggt gtgaggagga aaaaaagccc acacagccct cctccaggcc attcctcccg 5040 ctacatggag tctcacggtc cttcacggag tgctcagagc ctcccgatca tcccactttt 5100 agcacagcac ttttccactt ctggtcaatc tgatattaga atgtcctaca atgtgcttat 5160 tagccatttt ttaaaactaa catttataaa atttatttag gccaggcacg gaggctcaca 5220 cctgtaatcc caacatttcg ggaggccgaa gtgggcggat cacctcggat cagactggct 5280 aacatggtga aaccccatct ctactaaaaa tacaaaacaa ttagccaggc gtggtggtga 5340 gtgactgtaa tcccaactgc ccaggaagct gggctgatgc aggagaatcg ctggaaccca 5400 ggaggtggag ggagcttgca gtgagccgag atcttgccac tgcactccag cctgggaaag 5460 agagagagac tctaagacct tttgctgaat tgaagcctct tggtaaacct atgatgtccc 5520 atttaataga tgggcaaact gaggctcaaa gaaataaatc aggcacataa ggagtccttg 5580 tccaagcaag ccgtgggttg aagcctgggc tctgccagtc caagtgaaac caaagcatct 5640 ccagaaattc ctggtggtgt gttacatcag caaatgagta ggcaatgtca ttcctcaacc 5700 tgaagtcatt ggcatcttgc agattccacg aaagaggaag tctgcccctg gctccccgag 5760 gcctgcttgc aggaatgagg aaggggctgt tctgctttca ggtggaaatg ttcaccagct 5820 aagtgctggg ctgcccagaa ggcaaacagc cccctgggcc tgggacacat gctttagtgc 5880 ccctggtagc atggcaggac atgcagtgca actcaagaag ctcccaaaac catgagaggc 5940 tgggaaaaat gctccctggc ccagttctaa tgtttcccaa taaggagctg cccgggcaga 6000 cacagggggc cccacccatc ctgagaggct gggcctgggc tgcagacaca actgctcgga 6060 gagtctgcat cctggctctg cagacaggct ctagatgatg ggctggcgga gtgcccatgt 6120 ggaggtgcac acaccgtgtt tagtatgggc ctgtacctgc tgcagcgtca gctggtctgg 6180 gcctgtgctc ctggtccgcc tatagatgcc atgtggccca tctgaatctg tgactctgag 6240 tctggcctac ccaggcctgg tttgatccta ccccctggcc acgccccttg tgggtttcct 6300 ctgggtgacg tttccttcct gctcttgttg actaggcgct gttcttgctg gctggtgccc 6360 cagggcctgg agaggtctga agaaacctgg gagccagcag cccggggctc cactctgggt 6420 tctgaaagcc cattccctgc tctgcggctc ctcccacccc acctcttctc agccttgcag 6480 ctcaagggtt gatctcagga gtccaggacc caggagaggg aagaatctga ggaacacaga 6540 acagtgagcg ttgcccacac cccatctccc gtcaccacat ctcccctcac cctcaccctc 6600 cctgcctggc cctggacccc atcccaggac ctccctatca gctgacttct tccagtgtct 6660 tgcaggcccc tctgggctcc tccctcccct ggcttttcct accactcccc ctctatcggc 6720 gtctatctgt aggtgccctg ggatttataa aactgggttc cgaatgctga ataagagacg 6780 gtaagagcca aggcaaagga cagcactgtt ctctgcctgc ctgataccct caccacctgg 6840 gaacatcccc cagacaccct cttaactccg ggacagagat ggctggcgga gcctggggcc 6900 gcctggcctg ttacttggag ttcctgaaga aggaggagct gaaggagttc cagcttctgc 6960 tcgccaataa agcgcactcc aggagctctt cgggtgagac acccgctcag ccagagaaga 7020 cgagtggcat ggaggtggcc tcgtacctgg tggctcagta tggggagcag cgggcctggg 7080 acctagccct ccatacctgg gagcagatgg ggctgaggtc actgtgcgcc caagcccagg 7140 aaggggcagg tgagtggaca gaggacccca ctgcccgcgg gacatcccct ggccctcatg 7200 cctgccctgc cctcttggga ccccaccact gagccctccc tgccaggcct tcctctgagc 7260 agtgacatgg aactgccctc tcaaaggggg atccaggccc cagactatgc tggagccagg 7320 aggcttgctg ggggggccct tgctgcacct cagagcccca caccctcttc tattcaagtg 7380 cattcccggt actcgcctgg cattgtagtt atgagcacgg acttgggagt caggcactat 7440 caccagtcct ggcactatca ccctgggcca atcacttcac tgctctctgc ctatttctct 7500 gtctgcaaaa tgggagagtg agagcccgcc tccctgggct gtgtgagggt taaaagagat 7560 catccaggtt aaacatttag cacagagtct gatatacagt aagtgctcaa taaataagat 7620 ctcccattgc tgttgttgct gctgctgctg ctgctgttat tattactatt agccaccctg 7680 ggttgcagat gccgttggcc caataaagca cagctgaggc tcagtaggag tagtgattta 7740 cacagggaga tagtcaggac caagacagat atgcaaattt cctgatccta gatgaggatt 7800 ttttgcctag cagtatctat aagaacatca ttcttcagac aagcctctag aggaagttgt 7860 agagggctgc agcgggttca ctggtgtatc ctgacctctc ttggttgaga gaggagacca 7920 gctcagttcc ctggcacctt gcccctgccc ctctgatgtg tctccaacct ctttgtctct 7980 ttcctgtagg ccactctccc tcattcccct acagcccaag tgaaccccac ctggggtctc 8040 ccagccaacc cacctccacc gcagtgctaa tgccctggat ccatgaattg ccggcggggt 8100 gcacccaggg ctcagagaga agggttttga gacagctgcc tgacacatct ggacgccgct 8160 ggagaggtga ggcttgctga ggcaggctga cagccctggg tggagctgtg aatcagctgg 8220 tgggccccac ccagaggatc atgtctgtgc atggggacct gacatggggg acagcagagg 8280 aaggataggg gagccaggat gggctgtgag aggtaggcct gaaattgatc tttcagggac 8340 cttccggtgc tgagattgca tccctcccca tgtggcagat ggggaagctg agacccatgt 8400 gggtcaggca atgagtcttc cccaactggg tgcccaactc gtgacatggt ccttgcccga 8460 gaactcagct cataatattg atgggctctg aagatcctta gggtcagggt ggtccagaag 8520 ttccaaggag gactctgttt tccagacaga ccccaagaag aggctaggct tagtcctcgg 8580 cccaagccag ctccctcaca tgacttctct cttcactact aagagaataa aattgcccct 8640 ctacttcaac atggttttca aaataatatt attaagaccc atctctcagg actgttgtga 8700 gagtttaaaa tatatgcacc taaaagaaaa taaatgttca gtaaatggtt atttttcttt 8760 tcattccccc cagaaatctc tgcctcactc ctctaccaag ctcttccaag ctccccagac 8820 catgagtctc caagccagga gtcacccaac gcccccacat ccacagcagt gctggggagc 8880 tggggatccc cacctcagcc cagcctagca cccagagagc aggaggctcc tgggacccaa 8940 tggcctctgg atgaaacgtc aggaatttac tacacaggtg agaccctact gagctcctgg 9000 cggggtggtg aggtgggaga ggcaaatgta tggacaaaga gtcccctggg gaccaggttg 9060 gcaggctaag cagaggttct ggaaaacatc tgaaatttcc ctcaggaaat cccacccttg 9120 gagcctctcc actggtccaa ggcactgttt ccaggaggca cctggacatt aaaatacaga 9180 tgtgttaccc agcacatctg ggggacaaca gagcaaggag aggggagcct ggatggggtt 9240 acacatcacc tggtttactg gacagggcac tggcctgggg tcaggagagc tgggttcaat 9300 ttccactgag ctcctcaccc cacaagtggc ctacagctag tcccatgccc atcagtgctc 9360 actggacact ctctgggtta gaccatgtcc agggctttac gtgtacattg gacttaaatt 9420 ttacaaccac tccgtgagat aaatattgct attatcatac cctggagatg agaaaaccaa 9480 ggattaaaaa gatagtctca cagttagtac tagcaaagat ggaattcagc cttcactctc 9540 tctcacgcag caatccacac ttttaactct tacacaggga gtgccctcat ctgggcaata 9600 aggacagtca tactaaagcc acaggtacag aggctcctga tgccacaggt tccttaacca 9660 ctgaaccatg ctgcctctca atgccggagg atgcctcctt tgaaatctga gaggttttaa 9720 gttaaaagga tttcttctgc acccagcaag gagtgaactt gtagattttg tggtcgtata 9780 tggcctaggc tcaaagtata tcagtttact taattactca caccacttgc cattcaaaaa 9840 aagaagccat gtaacaaacc tgcacatata ccgcgtgagt ctaaaataaa aaataaataa 9900 actcatgttt taaaaatctt gatagggtat ataaggaaag atcaacattc gagttaaact 9960 aaggaaaaca gaaagtcaag aaaggaagat tctaatactt taagttgttt ctttctatat 10020 ttgagaatca caaagataac tttttctcag aattattttg tgaattaaaa aatccaaaca 10080 ataaagacaa ttaaaactga aaaacagaaa agaaggaaga taggaagaat ataaagacat 10140 atcagagcca tctatagaag cataaaaatg cagtaacttt aaaagtgaaa aaaacaagta 10200 tgaaaaccac aaagactacc tcattgctat aaataaagtt tccatcaatt gtgggcattc 10260 tagcaacact ggcaaaaggg gaaatataat gggttacaca atacctgtta ctcaatagaa 10320 aagaccaatg caagggaaat aacatttttc cctgattcta aattctaagg gaattcttca 10380 aatagggaac ttaaaaaaaa aacaacaaca aacttttatt ttagtttcag aaacacatgt 10440 gcaggtttgt tatataggta agttatgtgt cacaagtgtt tggtgtacag attatcttgt 10500 cgcccaggta ataagcatgg tacctgttag gttgggtttt ttgtttgttt ttgtttttgt 10560 tttttgtttg ttgttttttg ttttttgttg tgttttgttt tgagacagag tcttgctcgg 10620 tcacccaggc tggagtgcag tggcgtgatc tccactcact gcaagctctg cctcccgggt 10680 tcacgccatt ctcctgcctc agcctcccga gtcgctggga ctacaggtgc ccgccacgaa 10740 gccctgctaa ttttttgtat ttttagtaga gatggggttt caccatgtta gtcaggctgg 10800 tctcgaactc ctgatccgag gtgatccacc cacctcagcc acccaaagtg ctaggattac 10860 aggtgtgagc caccgcgccc ggcccggtag ttttttaatc cttacccttc tcccactctt 10920 cactctcaaa aggtcccagt gtctattgtt ccttttatgt gtccatgtgt acgcaatgtt 10980 tacctctaac ttatgaatga gaacatgcgg tatttcattg tctgttcctg tgttagttca 11040 cttataatgc tctccaactc cattcaagtt gctgcaaagg catgacctca ttcattttta 11100 tggctgtgta gtattccatg tgtaaatgtc ccacattttc tttagcagta tggttgattc 11160 catactggtg ctattgtgac tagtgctgca ataaacatac atgtacatgc atctttatgg 11220 tagaatgctc tacattcctt tgggtatata cccaataatg ggattgctag attgaatgat 11280 agttcagagt tctttggcaa atcgcgaaat tcctttccac agtggctgaa ctaatttaca 11340 ttcttaccag cagtgtataa gtgttcctgt ttctctgcaa tctcaccaat atctgttatt 11400 ttttgacttt taaattatag ccattctgac tggtgtagaa tagtacctca ttatgggttt 11460 gatttgcatt tttctttttt tccagagaag gtctcttttt ttattctatt ctggccttca 11520 actgatggaa tgagagccac ccacatcagg gagggtgatc ttattttttt tttttttaat 11580 actttaagtt ctagggtaca tgtgcacaac atgcaggttt gttacatgtg tatgcatgtg 11640 ccatgctggt gtgctgcacc cattaactca tcatttacat taggtatatc tcctaatgct 11700 atccctcccc cctcctccaa ccccacgaca ggccctggtg tgtgatgttg cccttcctgt 11760 gtccaggtgt tctcattgtt catttcccac ctatgagtga gaacatgcgg tgtttggttg 11820 tttgtccctg cgatagtttg ctgagaatga tggtttccag cttcatccat ttccctacaa 11880 aggacatgaa ctcatccttt tttatggctg catagtattc catggtgtat atgtgccact 11940 ttttcttaat ccagtctatc attgatggac atttgggttg gttccaagtc tttgctattg 12000 taaatagtgc cacaataaac atacgtgtgc atgtgtcttt atagcagcat gatttataat 12060 cctttgggta tatacccagt aatgggatgg ctgggtcaaa tggtatttct agttctagat 12120 ccttgaggaa ttgccacact gtcttccaca atggttgaac cagtttacag tcccaccaac 12180 agtgtaaaag tgttcctatt tctccacatc ctctccagca cctgttgttt cctgactttt 12240 taatattcac cattctaact tgtatgagat ggtatcccat tgtggttttg atttgcattt 12300 ctctaatggc cagtgatgat gagcattttt tcatatgtct gttggctaca taaatgtctt 12360 cttttgagaa gtgtctgttc atatccttca cccacttttt gatggagttg tttgtttttt 12420 tcttgtaaat ttgtttgagt tctttgtaga ttctggatat tagccctttg ttagatgagt 12480 agattgcaaa aattttgtcc cattctgtag gttgcctgtt cactctgatg gtagtttctt 12540 ttgctgtgca gaagctcttt agtttaatta gatctcatat gtcaattttg gcttttgttg 12600 ccattgcttt gaatgtttta gacatgaagt ccttgcccat gcctatgtcc tgaatggtat 12660 tgcctaggtt ttcttctagg gtttttatgg ttttaggtct gacatttaag tctttaatcc 12720 atcttgaatt aatttttgta taaggtgtaa ggaagggatc cagtttcagc tttctacata 12780 tggctaacca gttttcccag caccatttat taaataggga atcctttccc catttcttgt 12840 ttttgtcagg tttgtcaaag atcagatggt tgtagatgtg tggtattatt tctgagggct 12900 ctgttctgtt ccattggtct atatctctgt tttggtacca gtaccatgct gttttggtta 12960 ctgtagcctt gtagtatagt ttgaagtcag atagcatgat gcctccagct ttgttctttt 13020 ggcttaggat tgtcttggca gtgcgggctc ttttttggtt ccatatgaac ttttaagtcg 13080 ttttttccaa ttctgtgaag aaagtcattg gtagcttgat ggggatggca ttaaatctat 13140 aaattacctt gggcagtatg gccattttca tgatattgat tcttcctatc catgagcatg 13200 gaatgttctt ccatttgttt gtgtcctctt ttatttcatt gagcagtggt ttgtagttct 13260 ccttgaagag gtccttcaca tcctttgtaa gttggattgc taggtatttt attctctttg 13320 aagcaattgt gaatgcaagt tcactcaaga tttggttctc tgtctgttat tggtgtgtaa 13380 gaatgcttgt gacttttgca cattgatttt gtatcctgag acttcgctga agttgcttat 13440 cagcttaagg agattttggg ctgagacgat ggggttttct aaatattcag tcatgtcatc 13500 tgcaaacagg gacaatttga cttcctcttt tcctaattga atacgcttta tttctttctc 13560 ctgcctgatt gccctggcca gaacttccaa cactatgttg aataggagtg atgagagagg 13620 gcatccctgt cttgtgccag ttttcaaagg gaatgcttgc agtttttgcc cattcagtat 13680 gatattggct gtcggtttgc cataaatagc tcttattatt ttgagataca tcccatcaat 13740 acctaattta ttgagggttt ttagcatgta gggctgttga attttttcaa aggccttttc 13800 tgcatctatt gagataatca tgtggttttt gtctttggtt gtgtttatat gctggattgc 13860 atttattgat ttgcgtatgt tgaaccagcc ttagcatccc agggatgaag cccacttgat 13920 catgctggat aagctttttg atgtgctgct ggattcggtt tgccagtatt ttattgagga 13980 tttttgcatc aatgttcatc agggatattg gtctaaaatt cctttttttt tgttgtgtct 14040 ctgccaggct ttggtatcag gatgatgctt gcctcgtaaa atgagttagg gaggattcct 14100 tctttttcta ttgattggaa tagtttcaga aggaatggta ccagctcctc cttgtacgtc 14160 tggtagaatt tggctgtgaa tccgtctgga ttggtaggct attaattatt gcctcaattt 14220 cagagcctgt tattggtcta ttcagggatt caacttcttc ctggtttagt cttggaaggg 14280 tgtatgtgtc caggaatgtg tccatttctt ctagattttc tagtttattt gtgtagaggt 14340 gtttatagta ttctctgatg gtagtttgta tttctgtagg atcggtggtg atatcccctt 14400 tatcatttct tattgcatct atttgattct tctctctttt cttctttatt cgtcttgcta 14460 gcggtctatc aattttgttg atcttttcaa aaaaccagct cctggattca ttgatttttt 14520 gaaaggtttt ttgtgtctct gtctccttca gttctgctct gatcttagtt atttcttgcc 14580 ttctgctagc ttttgaatgt gtttgctctt gcttttctag ttctttctat tgtgatgtta 14640 gggtgtcaat tttagatctt tcctgctttc tcttgtgggc atttagtgct ataaatttcc 14700 ctctacacac agctttaaat gtgtcccaga gattctggta tgttgtgtct ttgttctcat 14760 tggtttcaaa gaacatcttt atttctgcct tcatttcgtt atgtacccag tagtcattca 14820 ggagcaggtt gttcagtttc catgtagttg agcggttttg agtgagtttc ttaatcctga 14880 gttctagttt gattgcactg aatggtctga gagacagttt gttataattt ctgtcctttt 14940 acatttgctg aggagtgctt tacttccaac tatgtggtca attttggaat aagcgcaatg 15000 tgtgctgaga agaatatata ttctgttgat ttggggtgga gagttctgta gatgtctatt 15060 aggtcccttg gtgcagagct gagttcaatt cctggatatc ctttttaact ttctgtctcg 15120 ttgatctgtc taatgttgac aggggggtgt taaagtctcc cattattatt gtgtgggagt 15180 cttaagtctc tttgtaggtc tctaaggtct tgctttatga atctgggtga tcctgtattg 15240 ggtgcatata tatttaggat agttagctct tcttgttgaa ttgatccctt taccattatg 15300 taatggcctt ctttgtctct tttagtcttt gttggtttaa agtctgtttt atcagagact 15360 aggattgcaa cccctgcctt tttttgtttt ccatttgctt ggtagatctt cctccatccc 15420 tttattttga gcctatgtgt gtctttgcac gtgagatggg tttcctgaat acagcacact 15480 gatgggtctt gactctttat cccatttccc tgtctgtgtc ttttaatcgg agcatttagc 15540 ccatttacat ttaaggttaa tattgttatg tgtgaatttg atcctgtcat tatgatgtta 15600 gctggttatt ttgctcatga gttgatgcag tttcttccta gcattggtgg tctttacagt 15660 ttggcatgtt tttgcagtgg ctgataccgg ttgttccttt ccatgtttag tgcttccttc 15720 aggaactctt ttagggcagg cctggtggtg acaaaatctc tcagcatttg cttgtctgta 15780 aaggatttta tttctccttc acttatgaag cttagtttgg ctggatatga aattctgggt 15840 tgaaaattct tttctttaag aaagttgaat attgccaggc gtggtggctc acacctgtaa 15900 tcccagcatt ttgggaggcc gagatgggca gaccatgagg tcaggagatc gagaccgtcc 15960 tggctaacat ggtgaaaccc catctctact aaaaaataca aaaaaaatta gccgggcgtg 16020 atggcgggca cctatagtcc cagctgctca ggaggctaag gcaggagaat ggcatgaacc 16080 tgggaggcgg agcttgcagt gagccgaggt tatgccactg cactccagcc tgggtgacag 16140 agcgagactc catctcaaaa aaaaaaaaaa aaaaagaatg ttgaatattg gcccccactc 16200 tcttctggct tgtagagttt ctgccgagag atccactgtt agtctgatgg gcttcccttt 16260 gtgggtaacc cgacctttct ctctggctgc ccttaacatt ttttccttca tttcagcttt 16320 ggtgaatctg acaattatgt gtcttggagt tgctcttctt aaggagtatc tttgtggtgt 16380 tctctgtagc tcctgaattt gaatgttggc ctgccttgct aggttgggga agttctcctg 16440 gataatatcc tgcagaatga tttccaactt ggttccattc tccccgtcac tttcaggtac 16500 accaatcaga tgtagatttg gtcttttcac atagtcccat atttcttgga ggctttgttc 16560 gtttcttttt actctttttt ctctaaactt ctcttcttcc ttcatttcat tcatttggtc 16620 ttcaatcact gatacccttt cttccagttg atcgaattgg ctactgaagc ttgtgcattc 16680 ttcacgtagt tctcgtgcca tggttttcag ctccatcagg tcatttaagg acttctctac 16740 actggttatt ttagttagcc atttgtctaa tcttttttca aggtttttag cttctttgtg 16800 atggattcga acttcctcct ttagctcgga gaagtttgat cttctgaagc cttcttctct 16860 caactcatca aagtcattct ccgtccagct ttgttccatt gctggcgagg agctatgttc 16920 ctttggaggg ggagaggcgc tctgattttt agaattttca gcttttctgg tcagtttttt 16980 ccccatcttt gtggttttat ctacctttgg tctttgacat aaagatgggg ttttggtgtg 17040 gatgtccttt atgtttgtta gttttccttc taacagtcag gaccctcagc tgcaggtctg 17100 ttggagtttg ctggaggtcc actccagacc ttgtttgcct gggtatcagc agcagaggct 17160 gaagaacagc gaatattgct aaacagcaaa tgttgctgcc tgatccttcc tctggaagct 17220 tcgtctcaga gaggtagacg gctgtgtgag gtgtcagtct gcccctactt gggggtgctt 17280 cccagttagg ctactcgggg gtcagggacc cacctgagga ggcagtctgt ctgttctcag 17340 atctcaaact ccatgctggg agaaccgctg ctctcttcaa agctgtcaaa cagggacatt 17400 taagtctgca gaggtttctg ctgccttttg tttggctatg ccctgccccc agaggtggag 17460 tctacagagg caggcaggcc tccttgagct gcggtgggct ccacccagtt caagcttccc 17520 agctgctttg tttacctact caagcctcag caatggtggg cacccctccc cccgcctcac 17580 tgccgccttg ctgttcaatc tcagactgct ggctagcaat aagtgaggct ccgtgggtac 17640 aggaccctct gagccaggca cgggatataa tctcctggtg tgccgtttgc aaagaccgtt 17700 ggaaaagtgc agtattaggg tgggagtgac ccaattttcc aggtgccatc tgtcacagct 17760 tcccttggct aggaaaggga attccctgac cccttgcact tcccaggtga ggcaatgcct 17820 cgccctgctt tggcttatgc tcggtgggct gcacccactg tcctgcaccc actgtctgac 17880 aagccccagt gagatgaacc tggtacctca gttggaaata cagaaatcac ctgtcttctg 17940 cgctgctcac cctgggagtt gtagaccgga gctgttccta ttcggccatc ttggaaccac 18000 cccaatttgc attttgctaa taataagtga tgttgagcat tttttcatat gctgttggcc 18060 atgtgtatgt cttcttttga agagtctatt ggtgtccttt gtccactttt tttttttttt 18120 ttttctgaga caaagtatca cactgtcatc caagctggaa tgcagtggtg agatctctgc 18180 tcattgcaac ctccacctca cgggctcaag ggattctcct gcctcaccct cccaagtagc 18240 tgggattaca ggtgtgcacc actacgccca gctaattttt gtatttttta gtagagatgg 18300 tctgttggcc aggctggtct cgaattcctc acctcaagtg atctgtccac ctcagtcccc 18360 aaagtggtag gattacaggc atgagccacc atgcctggcc atttgtccac tttttaatgg 18420 gattgttttt ggcttattaa tgtgtttaag ttccttatag attctggata ttccaacttt 18480 gtcagatggg tagtttgcaa atattttctc ccattctgtg ggttgtctat ttactttgtg 18540 ggtagtttct tttgctgtgc agaagttctc tagtttaatt aggtcccatt tgtcaatttt 18600 tgttttcatt gcaattgttt ttggcatctt cctcatgaaa tttttgccag gacctatgtt 18660 cagaatagta tttcctaagt tttcttcaag ggtttttaaa gttctaggtt ttacgtttaa 18720 gactttcatc catcttccat tgatctttgt atatagtatc aggaaggggt ccagtttcag 18780 tcttctgcat atggctagcc agttatctca gcatcacgta ttgaataggg aatgctttcc 18840 ccattgcttg tttttgtcaa cattatcaaa gatcagatgg ttgtagatat acagctttat 18900 ttctgggcat tttattctgt tccattggtc tatgtgtctg tttttgtacc agtagcatgc 18960 cgttttggtt actgttgcct tgtagtatag tttaaaaata ggtaatgtga tgcctccaga 19020 tttgttattt tttacttaga attgctttgg ctatttgggc tgttttttgg ttccatatta 19080 attttagaat ttctttctca ttatgtgaag aatgtcattg gttgcttgct aggaatagca 19140 ttgaatctgt aaattgcttg gagtagtatg gcgattttaa caatattgat tattcctatt 19200 catgagcatg gaatgttttc catttgtgtg tgtatcatct ctgatttctt tcagcagtgt 19260 tttgtagttc ttgctataga gatctttcac aatcctggtt agatgtattc ctaggtactt 19320 ttttgtggct attgtgaatg ggattgcatt cttgatttgg tgcttattgc acaatgttac 19380 ataatactta ctacccaata gaagaaattg attttgtatc ctgaaacttt gctgaagttg 19440 tttattagat ctagcagatt tgggacagag actaggaggt tttctaggta taaaatcata 19500 acatctgcaa acagagatac tttgacttcc tctcctccta tttggatgtc ttttacttct 19560 ttctcttacc tgattgctct gggtaggact tctagtacta tgttgaatag atggtgagaa 19620 tgagcatcct tgtcttgttc tggttctcaa ggagaatact tccagctttt gcccattcag 19680 tgtgatgttg gctgtggttt tgtcataaat ggcccttatt attttgaggt atgctccttc 19740 aatgtctagt ttgttgagag ttttaacatt gaaggatgtt gaattttatc aaatgctttt 19800 tnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 19860 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nattttttgg aatagtttca 19920 gtagaaattg tacgagctct tctttatgtg tctggtagaa ttcagctgtg aatctctctg 19980 gtcctggcct ttttctggtt gttagaattt ttattactga ttcaatttta gaactcgtta 20040 ttgctctgtt cagagtttta atttcttcct ggttcagtcc taggaattta tttcttgtag 20100 gtttttctag tttctgtata tagaggtgtt cgtaatagtc tctgagggtt ttttatattt 20160 ccacggggtc aatggtaatg tcctctttgt catttctggt tgtgtttatt tggatcttct 20220 gtcttttttt tttctttatt agtctagcta gtggtctatc gatcttattt attcttccaa 20280 agaatgaact tttggttttg ttgatctttt gtatggtttt ttccatctca attttattca 20340 gttcagctct gattttggtt atttcttgtc ttctgctagc tttgggattg atcttgctct 20400 tgtttttcta attcttctat ttgtgatgtt aggttgctaa tttgagattt ttctaacttt 20460 ttgatatggg catttactgc tataaacttt ccactgcttt atctgtgtcc cagagaattg 20520 gcatattgta tctttgttct aattggtttc aaagaatttc tttatttctg ccttaatttc 20580 attgtttacc caaaagccac tcaggagcag gttgtttaat ttccatgaat tgtatggttc 20640 tgagctatct tcttagtatt gatttttttt aattgtgcag tggtccaaga gtgtggctgg 20700 tatgatttgg gttttttaaa atttgctgag aactttttat ggcagattgt gtggttgatt 20760 ttacagtata tgccatgtgc aaatgagaag aatatatatt ctgttgtttg gggtggagag 20820 ttctgtagat gtctcttagg cccatttggt caagtgtcga atttaggtcc caaatatctt 20880 tgttagtatt ctgccttgat gatctgttca atactgtcag tgggtgaagt ctcccattat 20940 tactatatgg ttatctaagt ctcttcatca gtctctaaga actcgtctta tgaatctggg 21000 tgctcctgta ttggatgcat atatattcct gacagttaag tcttcttgtt gaattgaacc 21060 ctttattatt atgtgatgcc tttctttgtc ttttttgatt gatgttgatt tgaagtctgt 21120 tttgtgtgaa attagaatgg caatccgtgc ttttgtttgt tttccatttg cttggtagat 21180 ttttccccat ccctttactt tgagcctatg aatgtcattg catgtgagat ggatctcttg 21240 aagacagtat ccagttgggc catgcttctt tattgaactt gccactctgt gccttttaat 21300 gggagcattt agcccattta catttggatt aatatcgata tgtgtgagtt taatcctgtc 21360 atcatgttgt tagctgatta ttttgcagac ttgattttgt agttgcttga tagtatcagt 21420 ggactgtgta ctttagtgtg ttttatggtg gccagtgatg gtcttttgtt tccatattta 21480 gcactccctt aagaacctct tgtaaggcag gtctggtggt aataaattcc cttagcattt 21540 gcttgtctga aaaggatctt atttctcctt cgcttataag gcttagtttg gctgtatatg 21600 aaattcttgg ttggagcaca gtcatagtgg taggtggcca cgggggtgct tgtgtcactt 21660 tctaatgagg ttaaaggtgg ctttaggtgg cttagaagag acagagactg tatgtttggc 21720 agaaagtaag gaaagagaac aagagtctct gcctggtaac ctagataatt caaccagatt 21780 ttgtctaaga ctattaaggt ggtacctcta tgactgtgca agaaccacag cattactggg 21840 cttagggtgc cctctaaagc agaaatgcct tagatcacaa aactcaagtc ctttcaaatc 21900 tggaaagcct ttccaaaaag gctgcctata aataagcctg gacagtgaag actacaataa 21960 atgctcaact cttcaatgcc cagacactga agaacatcta ctagcattaa caccatccag 22020 gaaaacatga cctcaccata aggcatcagg gaccaattct ggagaaagag agatatgtga 22080 cctttcagac agaattcaaa atagctgtgt tgaggaaatt caaagaaact caagataaca 22140 cagagaagga attcagaatt ctagcagata aatgtaataa agagattgaa ataattaaaa 22200 ataatcaaac agaaattctg gagctgaaaa aattcaattt gcatactgaa gaatgcatca 22260 gagtcctttc atagcagaat ggatccagca gaagaaagaa ttagtgagct tgaagacagg 22320 ctataattga aaatacacag aagggacaaa agaaaaaaga ataaaaaaaa caatgaaaca 22380 cacctgcagg atctagaaaa tagcctcaaa atggcaaatc taagagttat tggccttaag 22440 gaggaggtag tgaaagaaat aggagtagaa gttttattca aagggataag agaaaatttc 22500 ccaaacctag ggaaagatat ccaagtacaa gaaggtttta gaacaccgag tagatttaac 22560 ccaaagtaga ctacctcaag gcatttaata atcaaactcc caaaggtcaa ggataaagaa 22620 aggattttaa aagcagcaag agaaaagaaa caagtaacat actatggagt tccaatacat 22680 ctggccgcag acttttcatg gaaaccttac aggccaggag agagtggcat gacatattta 22740 aagttctgaa ggaaagaaaa cttttaccct aaatagtata tctggtgaaa atatctttca 22800 aacatgaagg agaaataaag attttcccaa acaaaagctg aaggatttca tcaataccaa 22860 acccatccta caagaaatgc taaaaggagt acttcaatca gaaagagaag aacattaatg 22920 agcaatcatc acctgaaggt aagaaactta ctcataatag taagtacacc gaaaaacaca 22980 gaatattttt acactgtaac tgtggtgtat aacctctctt accctaagta gacagactag 23040 atgatgaacc aatcaaaaat aataactaca acaacttttc aagacacagt cagtaaaata 23100 atatataaat agaaacaaca aaaagtttaa aaagctggga aatgaagtta agtttttatt 23160 agttttcttt ctgcctgttt gtttatttgt ttatccaaat agtgttgtta tcagattaaa 23220 ataatagatt ataagatggt atttgcaagc ctcatggtaa cctcaaacca aaaaacataa 23280 aatggataca caaaaaataa aaatcaagaa accaaattat atcaccagag aaaatcatct 23340 tcactagagg aagacaggaa taaaagaaag aagagaagat cacaaaacaa gcagaaaaca 23400 aataataaaa tggcaagagt aagtccttac ttatcaataa taagtaatgt aaatggatta 23460 aactatccaa tcaaaagaca tagactggct gaatgaatga aaaaacaaga accaatgatc 23520 tgttccctac aagaaacaca tttcacctat aaagacacac ataaacagaa aatgaagtga 23580 tggaaaaata tattacatgc cagtggaaaa caaaaaagag cagaagtcac tatacttgta 23640 tcagacaaaa tagatttcaa aacaaaaact atacaaagag agacaaagaa ggtcactata 23700 taatgataat ggagtcaatt cagcaagagg atataacaat tttaaatata tatgcaccca 23760 acacaggaac acccagatat gtaaaggaaa tattattaga gctaaagaga gagataggcc 23820 tcaatacaat aatagctgga gactttaaca ccccactttt agcattggac gtatcttcca 23880 gacagaaaat cagcaaagaa acatgaaact taatctgcat tacagaccaa atggatctaa 23940 tagatattta cagaacattc catccaagag ctgcagaata cacattcttt tcctcagcac 24000 atggatcatt cccaaggata caccatatat tagctcacaa aacaagtctt aaaacattca 24060 aaaaattgaa ataatatcaa gcatcttctc tgaccacaat gcaataaaat tagaaattaa 24120 taataagagg aatttagaaa actacataaa tacatgaaaa ttaaacaata tgctcctgag 24180 tgaccagtgg gtcaatgaag aaattaagaa caaaataaag aaagaaaaat ttcttgaaac 24240 aaatgatcat ggaaacacaa tataccaaaa cctatgggat atattaagag agaagtttat 24300 agctataagt gcctatatca aaaaagagga aaaacttcaa ataaacaatc taatgatgca 24360 tcttaaagag agaaaagcag gaacaaacca aacccactat tagtagatga aaagaaataa 24420 caaagatcag agcagaaata aatgaaatta aaaagaaaaa atacaaaaga tcaatgaaac 24480 aaaaagttga ttttttgaaa agttaaaatc gacaaacctt tagccaagct aagaaaaaaa 24540 gagagaagat ctaaataaat aaaatcagaa atgaaaaagg aaacatttca gctgatactg 24600 cagaaattca agggatcatt agtggctgct atgagcaatt atatgccaat aaattggaaa 24660 atctagaaga aatggacaaa tctctagata catacagcct accaagactg aactaggaag 24720 aaatccaaaa cctaaacaga ccaataacaa gtaacaacat ggaagccgta ataaaaagcc 24780 ttacagtgaa gaaaagccaa ggactggatg gcttcactgc tgaaattcta cccaacattt 24840 aagaagaact agtatcagtc ctactgaaac tactccgaaa aaatagagga ggaaggacta 24900 ttaccacact cattctctga ggccagtatt accttaatac caaaaccaga caaaaacaca 24960 tcaaaaaatt gactaaaaac ttaaatctaa gacctcaaac tatgaagcta ctacaagaaa 25020 acattgggga aaatctccag gacattggtc tgcaaagatt tcttgagcaa taacacacaa 25080 gcataggcaa ccaaagcaaa catggacaaa caagatcatg tcaagttaaa aagcttctgc 25140 acagtgaggg atacaataaa caaagtgaag agacaaccca cagaatggga gaaagtattt 25200 gcaaactacc cttcagacaa gggattaata accagaatat ataaggagct caaatgactc 25260 tataggaaaa aatctaaaat tcgatcaaaa aatagttaaa agattagaat agacgtttct 25320 cagaagaaga tatacaaatg gcaaacaggc atatgaaaag gcattcaaca tcactgatag 25380 tcagagaaat gcaaatgaaa actgcaatat cacctcaccc cagctaaaat ggcttttatc 25440 caaaagacag gcaagaacaa attcttgtga gggtgtggag aaaaggaaac tctcatacac 25500 tgttggtggg aacgtaaatt agtacaacca ctatggagaa cagtttggag tttcctcaaa 25560 aaactaaaaa ttggctgggc gcagtggctc atgcctgtaa tcccagcact ttgggaggcc 25620 gaggtgggca gatcatgagg tcaggagatt gagaccatcc tggctaacac agtgaaaccc 25680 catctctact aaaaacacaa aaaattagcc aggtgcagtg gcgggcacct gtagtcccag 25740 ctactcggga ggctgaggca gcagaatggt gtgaacccag gaggcggagc ttgcagtgag 25800 ccaagattgc gccactgcac tccagcctgg gcaacagagc aagactctgt ttaaaaaaaa 25860 aaaaaaaaaa aaaacactaa aaattgagca atctcactgc agggtataca tccaaaagaa 25920 aggaaatcag aatatcgaag agatacatgc acttctatgt ttgtggcagc actgtttata 25980 atagccaaga ttggaagcaa cccaagtgtc catcagcaga tgaacagata tagaaaacgt 26040 ggtgcatata catactattc agccatcaaa gagaatgagt cccagtcatt tgcaacaaac 26100 atggatgaaa ctggagatta cgttatgtga aataagccag gcacaggaag acaaacatca 26160 catgttctca cttatttgtg ggatctaaaa ctcaaatcaa ttgaactcat ggacatagag 26220 agcagaagga tggttaccag aagctgagaa aagtagtggg gggcaagagg gaaaggtggg 26280 gatagttagt gggtacaaaa aaatagaaag aatgaataag acccactgtt tgatagcaca 26340 atagggtgac tatagtcaat aataaattaa ttgtatactt tttaataaca aagaatatag 26400 ttggattgtt tgtaactcaa aggttaaatg tttgaggagc tgcatacccc attctccatg 26460 atgtgcttat tttacagtgc ttacttgtat caaaccatct catggatccc cataaataca 26520 tacacctact acatatccac aaaattttta aaaaataatt ttaaaatttt aaaaaagaaa 26580 agaaattatt ggctggaatt tcttttcttt aaggatgctg aatataggcc cccaatctct 26640 tctgccttgc agggcttctg ctgaaaagtt cactgttagc ctgatgaggt tcccttttta 26700 ggtgacctgc cccttctgtg tctagctgcc tttattgttt tttctttcat gttggccttg 26760 gagaatctga ggacttgggg aatctgagaa ctatgtgttt tagggatgat tgtcttgtat 26820 agtatctcac aaggattctt tgcatttcct gaatttaaat gttggcttct ctagggaggt 26880 tggggaaact tttgtggaca atatcttcaa atacgttttc caagttgctg gctttctcat 26940 gctctttcag ggatgccagc gagtcataga tttggtctct ttccaaaatc ccatattcct 27000 cagaggtgtt gttcatttgt tttcattctt ttttattttt gtctgactga gttaatttgg 27060 agaaccagtc ttcaggctct gagattcttt cctcagcttg gtctatcctg ttgttaatac 27120 ttgctattgt attatgaaat tcttgtagtg agatttccag ctctatcaga tcagtttggt 27180 tctttcctaa aatgaccatt tcgtctttca gctcctgtat cattttatca tattccttag 27240 attgcttgga ctggattttg actttctcct gaatctcaat gatcttcatt cctactcata 27300 ttctgaattc tatgtctgtc atttcagcca tttcagcctg ttaaaatacc attgctgagg 27360 cactagtgca cacatctgga gataggaaga cactgtggct ttttgagtta ccagagctct 27420 tgagctggtt ctttctcgtg tgtgtgggct gatgttcctt taatctttga agttgtcttc 27480 ctttggatgg ggtttttttt gttttgtttt tatcttgttt gaagcccttg ggggtttgat 27540 tgtcgtataa ggtgggttca gtcaactggc ttcgattctg gaagatttca ggtggccaag 27600 gctcagctca gcactcctga gctgcctgct ctaactctag aaggctggta ctgggtcccc 27660 agccttttcc tctggcctcc tggggttcag aacctgctgc attagagggg ccaaggtgtt 27720 cccagtccgc tggcctcaac agtccagtgg gggtgcaggc caaagtgctt ccttgtggca 27780 gtgccagcag gatctgtgct cactcacaca tggcaccagc cacagtggca tggcagggtg 27840 cgtgtgtgtt ggctggggca gggcactggt gggagccttt gccttagttt tcatggcact 27900 gaatactgca aaacattttg gtgttgtatt ttggggccgc catccagtag gcagtgctta 27960 agagtgatca ccaggagata cactcttact ctgctgcgtg gttcttcatg tttcagtaca 28020 gttggcagta gtgttctgtg tgtgttgggg acaggggggt gaccttctca cttagtccac 28080 tcctgggcct tagacaagac ccttctgatt actggctcca cacctgcatt tcttttgttg 28140 ggtgttctgg taaacggggt tccttcaggc aggggccgtg gttggcagac aggccgtatc 28200 cttgctgggt ccgccctgca gagagaggca tacctcctcc acctcactcc tccaccagcc 28260 tacaaactcg gacatctcac ctctgtcagt gataagagaa tgagggctct tccctgcttg 28320 ggcattgtcc aaaccagcaa gtcctgctca gctaggatct gcagaagtgg gtggggtcac 28380 gtaatcagct gtctgggtgc ttcccagagg aacaggggat ggcacccacc cacagagtag 28440 cgggactgct gggctggaag ctctagcagg tgtggcccat ctggctacca caggcagagg 28500 tggttggagt cacccaccct gccatctggg tgtttccctg ggcagtaaca ggaggctgca 28560 cccctcagcc ggattcagac agaattagga ccactgcgct ggaagctcta gcagacatgg 28620 cctgcatggc taggagaggc ggtggtgggt ggagtcaccc accctgcctt cagagtgttt 28680 ctcggggaac cgggagtttg cctctgctgg ctgagttcag acagaagcag actgctagca 28740 gacattgccc gcctggctac cagtggcagg gatgggcaga gtcaccccct ctgctgtcca 28800 agtggggaac ctgaacaaaa taatatccca aagaacagtt tcatggcaag tataaaaacc 28860 acatggggcc gggcgcggtg gctcacgcct gtaatcccag cactttggga ggccgaggcg 28920 ggtggatcat gaggtcagga gatcgagacc atcctggcta acaaggtgaa accccgtctc 28980 tactaaaaat acaaaaaatt agccgggcgc ggtggcgggc gcctgtagtc ccagctactc 29040 gggaggctga ggcaggagaa tggcgtgaac ccgggaagcg gagcttgcag tgagctgaga 29100 ttgcgccact gcagtccgca gtccggcctg ggcgacagag cgagactccg tctcatataa 29160 actgatcacc tgatgagcat accacatggc atgctttttg atctatcaag agcaaaggta 29220 ggaaaacttg ggtgagtaaa agacaaaata gtaggctggg tgcagtggcc catgcctgta 29280 atcccagcac tttgggaggc tgaggcgggc agatcacgtg aggtcaggag tttgagacca 29340 gcatggccaa catggtgaaa ccccatctct acaaaaatac aaaaattagc tggcgtgatg 29400 gcgggtactt ataatcccag ctactctggg ggctgaggca ggagaatcgc ttgaccctgg 29460 gaggcggagg ttgcggtgag ccgagatcgt gccactgcac tccagcctgg gtgacaagag 29520 agaaactctg cctcaaaaaa ataaataaaa taaaatgaaa ttcacctaag gagagaaggg 29580 agaggcactg aacatattgt gaagaaaaga acagaagagg ttgtaactgt tgttctattt 29640 ggaatctgac taagatagga gcctgaggtc cctccctggg aaacaatggt gtatataacc 29700 tcagcttctc acaggtcatg aggtatcttc catcagtctc aggtcgggct cgcccggaag 29760 cctgagaccg gatttggttg caggtggctt actggggagg tgatttcttg gactcatcaa 29820 caagagagaa tggtgaggcg gggaaaccca ggaaaggtag attatggaac aggttacctt 29880 gtgagcaact gaggctccat gctgctgcgg ttgtctggga gacggtgaga acatgcctga 29940 gtgttatctg tcctgggggc aagggacccc acctgtcatc cactaagggc acctcctgga 30000 ctggagaact atcctccagt atttctggcc cacccggctt acaggcagag aaagagccct 30060 cagacagatg accacagctg ttgccgaggg acaccattgg caggtactgc aatagcaggt 30120 gcttgggatg ctgtcagcat ctgctacaca ctccagtgtc tcctccagtc ctccatgacc 30180 tacaaccccc agcttacaga tgagaaaact gaggctcata ggggctaatg gccagcatca 30240 taagcttagt gaggactgag agttgtgtct gttttgttca ctctttatcc ccaaccccta 30300 tagcagcacc tggcacagag taagcccttg ataaatattc agtgaaggaa agtcattcag 30360 ggagtgtgac ttagttgagt caagacttga actgggatat tctaccccaa agcacatgtc 30420 cttccgccgc atggcaagaa cataccagtg tctgcagaca gcgcagtgtg tggcttggag 30480 taggtgccta aaacagttgt ttaattaatt aattaattct tctattctct acctttatgt 30540 tccttctttc cctcccttac ttagaaatca gagaaagaga gagagagaaa tcagagaaag 30600 gcaggccccc atgggcagcg gtggtaggaa cgcccccaca ggcgcacacc agcctacagc 30660 cccaccacca cccatgggag ccttctgtga gagagagcct ctgttccaca tggccctgga 30720 aaaatgagga ttttaaccaa aaattcacac agctgctact tctacaaaga cctcacccca 30780 gaagccaaga tcccctggtc aagagaagct ggcctgatta tgtggaggag aatcgaggac 30840 atttaattga gatcagagac ttatttggcc caggcctgga tacccaagaa cctcgcatag 30900 tcatactgca gggggctgct ggaattggga agtcaacact ggccaggcag gtgaaggaag 30960 cctgggggag aggccagctg tatggggacc gcttccagca tgtcttctac ttcagctgca 31020 gagagctggc ccagtccaag gtggtgagtc tcgctgagct catcggaaaa gatgggacag 31080 ccactccggc tcccattaga cagatcctgt ctaggccaga gcggctgctc ttcatcctcg 31140 atggtgtaga tgagccagga tgggtcttgc aggagccgag ttctgagctc tgtctgcact 31200 ggagccagcc acagccggcg gatgcactgc tgggcagttt gctggggaaa actatacttc 31260 ccgaggcatc cttcctgatc acggctcgga ccacagctct gcagaacctc attccttctt 31320 tggagcaggc acgttgggta gaggtcctgg ggttctctga gtccagcagg aaggaatatt 31380 tctacagata tttcacagat gaaaggcaag caattagagc ctttaggttg gtcaaatcaa 31440 acaaagagct ctgggccctg tgtcttgtgc cctgggtgtc ctggctggcc tgcacttgcc 31500 tgatgcagca gatgaagcgg aaggaaaaac tcacactgac ttccaagacc accacaaccc 31560 tctgtctaca ttaccttgcc caggctctcc aagctcagcc attgggaccc cagctcagag 31620 acctctgctc tctggctgct gagggcatct ggcaaaaaaa gacccttttc agtccagatg 31680 acctcaggaa gcatgggtta gatggggcca tcatctccac cttcttgaag atgggtattc 31740 ttcaagagca ccccatccct ctgagctaca gcttcattca cctctgtttc caagagttct 31800 ttgcagcaat gtcctatgtc ttggaggatg agaaggggag aggtaaacat tctaattgca 31860 tcatagattt ggaaaagacg ctagaagcat atggaataca tggcctgttt ggggcatcaa 31920 ccacacgttt cctattgggc ctgttaagtg atgaggggga gagagagatg gagaacatct 31980 ttcactgccg gctgtctcag gggaggaacc tgatgcagtg ggtcccgtcc ctgcagctgc 32040 tgctgcagcc acactctctg gagtccctcc actgcttgta cgagactcgg aacaaaacgt 32100 tcctgacaca agtgatggcc catttcgaag aaatgggcat gtgtgtagaa acagacatgg 32160 agctcttagt gtgcactttc tgcattaaat tcagccgcca cgtgaagaag cttcagctga 32220 ttgagggcag gcagcacaga tcaacatgga gccccaccat ggtagtcctg tgagtaccca 32280 aaccacccat gtctgcagct cctctgtcag tccataagcc cgagtgacct gagcaccaga 32340 ggcatgctca ctgagcctgg ccgggggtgg gggtggctaa aggatcaggg tctccagtga 32400 cttctaactg cccaatctga ccaccatgct ctgtctacct aagctcctct gcagtgcccg 32460 gctgtggggc aggccctgct tgcttctgga gaggccactg ggccctgccg tgtctgccac 32520 ctctcctcct acccttagaa tgaccgtcca tcttctctcc tggacgtgct cccaccttcc 32580 tgctgacctt agctaccccc acactcccca aggatcaggc agccccccta cccctgctcc 32640 tctctctggg cattcattct ctcggagaaa ttgctcctcc agctgcttca ggaacccttt 32700 ccatgacaac gacattggcc ctgagatctt ccccatcagc tctcaaaaag agtttcctat 32760 ttttccctgc aaaactgact tcataattat ttgtgaccgt ggtgccacca ttgccccacg 32820 cactcaacct gaaacccact ctctggctgt gcaagaatag cttcctccaa ccattctcca 32880 gcccggcagt actgagaact tccccatccc ggttccccga cctctgtcgt ggtatcgatc 32940 acatgctatt ggtatgtgtc tatgtcctgc tttagatgat tcattccata agtatttatt 33000 aagcactgat tctatgccag gccctacaga tcacagatta aaaagcacaa tacgtgcctt 33060 aaggagctca aaaaataggg gagaagacaa acattcattt aatgacaaca gagtgtgatg 33120 tctacaacag tagaggtatg gacaaggtac tgtggtggcc caaggaaggg agtgatcaca 33180 cgagtgggta ataaagattc tcagaggaag aaatgtttga tgtgggcctt catatttata 33240 tttttaaaac tttttctcac aataaaattc atgttcatta taggaaaaaa aaaaagatat 33300 gtaggctggg tgaagtagct cacacctgta attccagtac tttgggaggc tgaagcagga 33360 ggatcacttg aggccaggag tcgcgagtcc agcctggcca acatggcaaa accccatctc 33420 taccaaannn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 33480 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnaca tgtctatggc 33540 gaaatcgaac catattgtac attttatttt atagtcagtt ttattttctt tatatatcaa 33600 ggctatcttc ttgtgagtgg gtgtggggta tttttttttt tttttggttt gttgttgtct 33660 gagacggagt ctcgctctgt cccctaggct ggagtgcagt ggtgcaatca cagctcactg 33720 cagcctggac cacccgagtt taagggatcc tcttacctca gccccgctag gagctgagac 33780 tacaggagca tgccaacatg cagcgataat ttttgtatat tttttgtata gagatgagat 33840 ttcactgtgt tgaccaagct ggtctcaaac tcctgggctc aagcaagctg ctcaccttga 33900 cctcccaaag tgctgggatc acagatgtga gccaccatgc ccagcccttc atgtgtgtgt 33960 gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtaa agagagagac agagacagag 34020 acagacagag tcttgctctg tcaccaggct ggcatgcggt ggcacgatct ctgctcactg 34080 caacctctgc ctcccaggtt caagcaattc tcatgcctca gcctccagag tagctgggat 34140 tacaggcacg caccaccaca cccagctaat tttttgtcgt cttagtagaa acggggtttc 34200 accatgttgg ccaggttggt cttgaactcc tgacctcaaa tgatccttct gcctcagcct 34260 cccaaagtgc tgggatcaca ggcatgagcc accatgccag ccctttatat atttttaagt 34320 agtctgatac aacaccattt tgaatgacca catgctattc catagggaaa gtgccatggc 34380 tgatttgacc attctcctgt tcttggccat tttaagctgc ttctagttgt tcatagttat 34440 aaatgtgagg taggctttaa aggacctgga tacaggtact gcaagtagca gtggacaggt 34500 gggaaatctt caaagagttc caagcagcta gaggcagaga ccagatggtg gctggccggg 34560 ggatgggggc tgggataggc aaagggaacc aagaacacca ggccagagag gagagtttgg 34620 gatgggcggg agcaagtgga gccttaagag aatgccgtgg gaaagtcaga ggcttcagcc 34680 acaagtggga ggagaaattt tcaaggaact gaatcagcta gacaaggcag agagaagaat 34740 aaggaagcta ctataaaaaa aacaagaaag ttcaggaaga aagggtggag gttggccaag 34800 ggctgtgtga aggaagagga gaaccaacca gagatattaa agaaatgaag atgacaggat 34860 gggagcttcc aggtgtttgg catgtgatta agcattgtgc attttcaagt ataaattttt 34920 aatttttttt cagtcacctt gattaaggat ggcaaatgcc aacggaaata aaaaatttaa 34980 aatgtatttt tcaagtttca caaaactaag taagtgtaaa agaaattcaa gtaagctttc 35040 atttatggct ttttttcgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgatgg 35100 gatctcactg tgtctcccag gaatagtgca gtggcatgac catacctcac tgcaaccttg 35160 gcctcccagg ttcaagccat tctcctgcct cagcctcctg agtagctggg actacaggtg 35220 cacaccacca cattgcctaa ttttttttat tttttgtaga aacaggatct tactatgttg 35280 cccaggctgg tcttaaactc ctgggctcaa tcgatcctcc tgcctctacc tcctcaagta 35340 ctgggattac aggctagagc cactgcactc agctggcttt ttttatatgt ttgtagtact 35400 tgtactttca cagttattaa aacttaaaac tgctctagga cttttgggat actctttagt 35460 agcttttaat tctcacatac ccctctgcag caggtgctat tgctgtcctc tttttgcagc 35520 ggaggagacg gagtgaggga tagagaagca gagcccacat gcaggcagct ccttgcctct 35580 gcttcttgcc cccacatggc tctgcagact taggggaaaa gatgaattca ggttggggac 35640 ttgttgggac aaattagatc aaagaattac tgggatgtca aggtggggat atccaaccag 35700 aaaatagaaa actcagtctg aaacttggaa tagagatttg agtagttgaa gccttggagg 35760 tagatgaggt cactaaagta gtgtgtgaaa acagattaaa tgggtggaaa ttccatgagt 35820 cggagtctac tctaacaccc tcagacaaag cccgtcatct cttcccctca ggctgcaaat 35880 attctctcct agggcactcc ccacagccat ccctctcctg cagaccctct tgttatcctg 35940 ggaaggggat agggtcccat gtgagccaac ctggacagca gtgcatatga tcatatacag 36000 tcttccatca caacatcctc aggccatctc tcccaggctc tgctactgcc tcccgtcttc 36060 tttcctgggc actgggactc cagaatgtct gtgccaaagg aacacaggtg tgtctgcatc 36120 aaagggagga aggctgtcaa atttttttta aattaaaatt gtcaaatttt tttaaaaaat 36180 ttttttctta tctgcagtgc tcagcaccga ggccaggccc tatgctgagc agtgcagatt 36240 taaaaaaaaa aaaaaaaaga caaatgtctg tccctgccct caaggaactc ccagactgag 36300 gcagaaacag acacacaagg tgaacacttc agtgtgctat ggtaagggct gtagaatgag 36360 gagccacatg gaggagaggg acctagccct gggtgacggg cagacaggca gggagggctg 36420 gtcagtactg gagatggcac tacccgctgt ggcctggtgc tcagggggac aaagcccaac 36480 tgtgttgtgc agggctgggc acctcaaacc cctgccagac acatcacatc tctcctgtcc 36540 ttctctctcc ccctcaggtt caggtgggtc ccagtcacag atgcctattg gcagattctc 36600 ttctccgtcc tcaaggtcac cagaaacctg aaggagctgg acctaagtgg aaactcgctg 36660 agccactctg cagtgaagag tctttgtaag accctgagac gccctcgctg cctcctggag 36720 accctgcggt gagtctggcc tgggttctgt tctgaagcag ggatggggaa gagatccaag 36780 gcagaagctg agtctgggct gagggatttg tctttgtatt tatcaaatat tcgttgaggc 36840 cttctctggc cgggatgact ttagtaaaca aacacagata ggattttttg ctcataaaaa 36900 aaaaaaattc cttcatgaag tgaggtttga gggaggtctg aagaatacag aggatttaac 36960 taagcaaagc aggaaggaaa aagttccagg aaaagagaat agcatgtaca aaggctctgt 37020 ggagaaggca ggaggaggca gtcaagaaac tgaaagcagg ccagtgtggc tggagggcag 37080 atagcaaagg gttgagataa ggctacagat accaacagga agtagactat gccaggcttg 37140 tgcaccatgt tggatttggg tcttttcctg atggtgtgga attttaagca gaaggttgac 37200 aagatcagta cattttcaaa atagaaactg gctgcaacat gaaaaatgga ttggaagagt 37260 gcaaaagtag aggctgggaa accaattagt aagaaaattg cagtagtccc tgcaaagatg 37320 gtaatagttt ggactaagga aaattatagt acttttaaat gtcacagata gtcatagctt 37380 ggactatggt attggcaatg gatatggaac aaagtggcta gatgtaagag agctataaaa 37440 gtaaaatcaa ctgggctcag taatggattg aatttaggaa aaaaaggaag aggaagatat 37500 caaggatgat tttaggtttc tgaccaaagc aactcatggg atagtggcat cttcatcaag 37560 atggggaata ctaaaagaaa agtgattggg gggtggaatg tcatgagtgt gatctcgtgc 37620 ttgctggttt caagatcctt tcaaggtatc caactggacc tgccaaatag gtttcaaata 37680 aagctctatc caaagataag atcgccccat aaaacaaatc tcaacaaatc tgcaaggact 37740 gaaatcatac aaagtatatt ctctgaccat gataaaatga aattagaaat caacaataga 37800 aggaaagttg aggagttcat aaatacactg aaatttaaaa acacactcct aaataaccaa 37860 gggtcaaaga agaaatcaca aaggaaatta gaaaacactt tgagagggat gaaaacaaaa 37920 tcacaacatg ccaaaatgta ttggatgcag ctaatgcagt gcttaaaggg aagtttatag 37980 ttataaatac ttctatttaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaag gaaagataga 38040 gccagacatg atggcccatg cctgtaatcc caaatctttg gaaggccaag gcaggacaat 38100 tgccttaggc caggagtttg agaccagcct gggcaacata gagagactct gtctctacaa 38160 aaacataaaa taaaaaatta gttgagaatg atgatgtgtg cctgttgtca taactactta 38220 ggaggctgag acaggagtct cacttgagcc caggagttca acactgcagt gagctgtgat 38280 tatgccattg cacttcaagc tgggcaacag agacagacct tatctctaaa aaaattactc 38340 aaaatttttt taattaatga aaaaaagaaa agataccact acatacttat cagaatggcc 38400 aaaatccaaa acactgacaa caccaaatgc tgacgaggat gtggaaaaac aggaactctc 38460 attcattgtt ggtgggaatg caaaatgaga cagccctttg gaatacagtc tagcagtttt 38520 ctacaaaaca aaatatactt ttaccatagg atccagcaat cgcattcctt gatatttacc 38580 aaaatgagtt gaatacctct gtccacacaa aaacctgcac atgaaatgtt atggcagctt 38640 tgttcatcat tgccaaaact tggaagcaag ctgcccttca gaaagtgaat ggactcataa 38700 attgtggtac aaacagacaa tggactatta ttcaatgcta aaaagaaata agctatcaag 38760 ctacacaaag atatggagaa actttaaatg catgttacta aaagaagcca atctaaaaag 38820 gctacatgct gtatgcttcc aactatatga cattctggaa gaggcaaaaa caaggagata 38880 gtaaaaaaga ttagtggttt ccagggcttg ggggaaggga ggaattaata gagcacagta 38940 gagttttagg gcagtgaaac tattctatat actgtaatcg ttcatacata ttattttaca 39000 gcaatcaaaa tccatagaat gtacaccacc aagagtgacc cctaatgtaa actatggact 39060 ttgggtgatg atgtgtccat gtcagtttat caattgtaac aaatgtcacc acaatatact 39120 agtgtggtac agcatgtcag cagtgggaaa gattgtatga gggatgtggg gacagggagt 39180 atatgggaac tcgctgtact ttttgctcaa ttttactgtg aacctaaaac tactctaaaa 39240 aataaagtct attgaagaag aagaaaaaga agaagcagag aaatctcaaa taagtaacct 39300 aaacttacac cttaagaatt tgttttcctt aggtctttca tgttagtaaa gaaatttttt 39360 taaaaagcaa acaaaaccaa aagcaagcag ggggaaagaa ataataaaga ctagagtaga 39420 gataaatgaa acagagaata gaaaacaata cagaaaacca atgaaactaa aagttagttc 39480 tttaaaaata ccaacaaaat ttgacacctt ttagctagac taactaaaat gtaagaagat 39540 tcagattact aaaatcagaa ataaaagatg gggtatcaca attgaactta cagaaataaa 39600 aggattataa gtgaatatta tgaaatattg taagccaaca aattagataa cccagatgaa 39660 atgagccaac ttccagaaag acacgaatac ctaaactgac ttaataagaa atacaaaatc 39720 ttaatagact tataacaaat agagatattg aattagtaag ttaaaaactt tctatgaaac 39780 aaagcccagg cccagatggc tttactagtg aattctacca aatatttaaa gaagaattaa 39840 cacttatatt ttcaaactct tccaaaaata gcagaaaaga gaacacgtgc caatttgttc 39900 tatgagacca atattaccct gataccaaaa ttagactaag ataaccaaaa gcatcacaag 39960 agaaaaaaaa aaatctgcag gccaacatcc tttatgaata tatatgcaaa attacataac 40020 aaaatactgg caacatataa aaaggattgt agaccatgac caaatgggat ttatcccaag 40080 aatgcaaatt gggtttaacc taggataatt gatcaatgta atgcaagatg ttaatagaat 40140 aaagaacaca aaccaatgat tatctcaaag acacaaaaaa agcatttgac aaaccctcaa 40200 caccttttca cattggaaac actcagcaaa ctagaaataa aagggaactt tatcaacctg 40260 ataaagttgt taccagtaaa aaaacccaca gctaacatta tacttaatgg tggatgactg 40320 agtgctttcc ccctaagatt aggaataaga caaagatgtc tgttcttgcc atttctattc 40380 aacactgtac tcaaggttcc atccagagta attaagcaag aaaatgaaat taacagcttc 40440 cagattgaaa agaaagtaat taaactatta tttgcaggtg acatgacatt gtatgtagaa 40500 aatcctaagg aatctacaaa tattaaaatt agggccaggt gtggtggctc atgccctgta 40560 atcccagcac tttgggaggc tgaggtgggt ggattgcttg agttccggag ttcaagacca 40620 gccagggcaa cacggcaaaa ccctgtctct accaaaacta caaaatatta gccgggcgtg 40680 gtggcacacg cccacagtcc cagctactca ggaggctgag gtgggagaat cgcttcagcc 40740 tggtaggcag aggtagcagt gagctgagat tgtgccactg cactccaacc tgggtgacaa 40800 agggagaccc tgttgaaaaa gaaaaaaaaa aaaaactaat gatttcagca aggtttcagg 40860 aaacaagatc aatatacaaa aattagttgt gctgctatac actagcaatg aacaatccaa 40920 aaattaaatt aggaaaacaa taaatttaac aaaagaaaac ttgtatacag agaactacaa 40980 aaaatggttg aaagaaatta tgtaagacct aaacaaatga aaagatgtcc catgttcata 41040 gattggaaga cttgatatta ttaaaatgac aatacccccc aaactgatct atagattcaa 41100 cacaatccct agcagaatcc cagctgccat ttttgcagaa atggacaagc tgattctaaa 41160 attcacgggc aaaaataaga ttgcatgtca agcctgggga acacccccac cccagtattt 41220 aatggccaaa gaagagccaa caaacaagcc tgagaaggag gggccacata gatgaaaggg 41280 acaccaggag agagaggtgt catggaaccc aacagagcag agggtctcaa ggaggtcagc 41340 agggactact gctgctggga ggtcaagata aggactgaaa tttgtccatt ggattagtga 41400 taggaaagtc atgaataacc tcagcagaag taccttatgt ggcatgaaga ggcgagaatg 41460 cagactggag tgcattaaag agggagaagg aggtaaaaaa tgagaccaca aatatacttt 41520 cttcaaagac tttgatagtg aaaaggagga aagcactaaa gcagagggga tggtgggctg 41580 agtaaatgtg tgtgtctgtc catctgctta cttgcattta agaaggaaaa gatctcagtt 41640 tgagaatcag tgaggaggag tgataaaaga cacaagaagg agaaggagtc attgatggct 41700 ctctatcctc agagggagga gggaggagaa agattggcct tggaagaggg aggaaatgtc 41760 tcctttactg taacaggagg ggagatgagg acagggaaga gagaggtagg aacataggtt 41820 tcactctgag aagctgagag gttcccatct gatggcttca tttttctctg agaaatagga 41880 attgagttgt ttgctaggag tgagtgtgag ggttgaggtg ttggatgttt ggaaaaggca 41940 agatcttcaa atggccatag tagagtgtga gatctctgcc agtcaggcat tgctgtggcc 42000 catgtgaggg tcgtggtcat gattcaagat ggcagccacc taccctgttg cacaggtggg 42060 agtttgggtg gagagtggaa agatagcaga attcagggtt agccagagga atgtgatgga 42120 aggtcagagg gtaagaaatt caggatactg ccaatgaagg atcatggaat ctaacctaca 42180 ttccaaggaa attgaagaaa agaaagtgat aagagattga aagaaaggag attgtaagaa 42240 aggaaagcaa tcaagagatt gttcattaaa aaggagttac agtgggagaa gttgagcagg 42300 cagagtaagg aggcaagccc agcctcgtgc tgtgactttc tgtatcctta tgaggctgct 42360 ctacccactc ctaagaccta tccaaggaga accaccagag tccagtaaaa atgcttttga 42420 gcaggactaa ggaatcctgg ttctacctca aacttcacac cccagtagga agacaccaat 42480 cttcaatgta ctctttttac attggcataa aatttcacta tctcaaaccc aaacaactcc 42540 atatctgtta attccttggg ccttccaaaa aacaccctgc aaaaatggca gagcagggac 42600 aactatttcc attttccata tataaagaaa tgggctgggg aaggacattc cctggagcca 42660 gacacacaca ggttcaaatc ttgcctctaa cggttacaga ttgaaaacta tgggtaggtt 42720 tctttagcaa gtttcagttt actcattaat aacatggatt caccatgtct gttgcacaag 42780 gtggctgcaa agtcaaagga gatcatgtat gctatgtact tagggtcatg tgtgctatgt 42840 atttagggtt atgtatgctg tgtacttagg acatgtatgc tatgtactta gggtcatgta 42900 tgctatgtat ttagggtcat gtatgctatg tacttaggac catgtatgct atgtacttag 42960 gttcatgtat gatgtgtatt tagggtcatg tatgctatgt atttagtgcc taagttgtcc 43020 taagaggctc agagaaatta acgacttgcc aaattcacat agttagcagt gggagagctg 43080 gacacaactc agtgagggca agacccagca ccaacaccta cctcctctgt atcaggtaag 43140 ccggggcctg attaattctg gctcactggg ggtagcacca taaaaagaaa gaaaatctgg 43200 atcaggcaaa tgtgaatatt tcccacaata aaaacatctc catatggaag agtgtacagc 43260 cttacacacg cttggatgcc agactaccca agacccgcaa gcaaagtcct aactattgta 43320 tttctctttt tttatttacc tacatctcct catttcctcc ccctcctcgc ccctgataac 43380 caccatttat tctttctttg tatgtatttt gcttttttaa tatttcaccc acaagtgaga 43440 ccatgtagtg tttttctttc tatgtctggc ttatttcact tttggattta tcttgaaatg 43500 cattgaataa gctggtgtca ggaattctgg ggttttgtca ctaatggcct gtaaaattga 43560 gcacattacg cagacaaagg agaaattggc gggagtgggg ttggagagct caaaaaagga 43620 atgccggaaa gagcaaggtc aggggagaga ccagacagag ggtgtggaat cgaagacaca 43680 tgagggtgga cccttggcct cctctggaac agaagagaaa ggactatgtg atgacacagg 43740 cagctgagga cggagaggtg gagggacatt gagggttttg gtttccacag cagcagcatc 43800 ctttaagcac taaccttctg ccaggggctt ccctaagaac atggcataca tgatctcctt 43860 ggatcttgca gccagctggt acaatagaca tggcagatcc atgatactga tgagtgaact 43920 gaaacttggt aaagacatct gccatagtct tcaacttgta actgacagag gcaagatttg 43980 aatccatctc tgtctagccc caaagcctgt gcccttcacc tgcccactgg ggtcactgct 44040 gcccaaggtc agatggcctc agcctcctca cagaggtcca ataccaagtg tcttcactgg 44100 ggcttgggag tgggtaggat tggaggaggg aagtccacac agcacagctt gtaatggtaa 44160 gttgtttaaa aataaataaa gccccaagag aggcacattt gtttatgctg agatgagtcc 44220 tgggggcagg aacatgccta gtgaaaatga gcttgtcaac aagcaggaaa tcagataagt 44280 ttgttgggta gccccccccc cccagggaga gagtgggctg nnnnnnnnnn nnnnnnnnnn 44340 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 44400 nnnnnnnnnn nnnnnnnnnn ggagtgtctg acggcagcaa cgagacgcag tagagagcgg 44460 cgagaagcga gagagtcaga cgagcagtga agtctgtgat gacgatagag cacgcagtac 44520 aacagcgcag tcgcgcagag cggacagaga gtgacatcta agtggataga cggtgtacta 44580 gagacgagct gactagacga gagagcatga gtaatcatcg cagacagaca ggacgaagta 44640 gagtaggtct ggagaagcac aacgcgaata ggcgtagcaa gacgagagcg atgatatatg 44700 cgcagaggag tgagatctgg acagcagaac ggagatagag agcgacgagc aagagagtga 44760 acgcactagc ggaacacgat acgagtacga aaagaaggag cggacgccac agaaggagca 44820 cgagtacgca gacgagctag tgtcgcgacg ggtgaggtca aagacatctg aagtcaagtg 44880 tatacgtagt atagcgcaag gcgacgagtc gaggagcgaa gtctgacgtc gaagacagcg 44940 gcgtagagca catatggcga cgcgatagtg agagagttag atcacgcgag aagaacgtca 45000 cgcgtgcagt gagaacgcaa gagtaagagt aacgaatgtg cacgagagag aagtgcaaag 45060 caaaaacgag tagtcgtgca gaacgagaga tacgacgacg gttcctaggg agcggagaaa 45120 tggacgcatg gcgatgcact agcgagggta cgtagagacg acgacaacga tagtacgtgc 45180 atgagtgccg gcacatgcga gccgagctca tgtaaccacg cacgatgcgg gagttagact 45240 cataggcgag taaaaganga gaggagagtg cggagtagca gatacgagga gaggagcgat 45300 cagagctata attgagattg cgcaatagaa gaagaggagg agagatagac gtgttagaca 45360 gtagtagaga gacataggga ggggatttat ttccctagag agaaagagag aagggagggc 45420 tatacaccga gagaacaaaa ttggattcaa ggttgagaga gaagacgtgt aattcggagt 45480 agataccacg cagaaccaag tgattgtgtg gataaaagcg gaaaaaggga tgggacaaac 45540 cctcaacacc ttttcacatt ggaaacactc agcaaactag aaataaaagg gaactttatc 45600 aacctgataa agttgttacc agtaaaaaaa cccacagcta acattatact taatggtgga 45660 tgactgagtg ctttccccct aagattagga ataagacaaa gatgtctgtt cttgccattt 45720 ctattcaaca ctgtactcaa ggttccatcc agagtaatta agcaagaaaa tgaaattaac 45780 agcttccaga ttgaaaagaa agtaattaaa ctattatttg caggtgacat gacattgtat 45840 gtagaaaatc ctaaggaatc tacaaatatt aaaattaggg ccaggtgtgg tggctcatgc 45900 cctgtaatcc cagcactttg ggaggctgag gtgggtggat tgcttgagtt ccggagttca 45960 agaccagcca gggcaacacg gcaaaaccct gtctctacca aaactacaaa atattagccg 46020 ggcgtggtgg cacacgccca cagtcccagc tactcaggag gctgaggtgg gagaatcgct 46080 tcagcctggt aggcagaggt agcagtgagc tgagattgta ccactgcact ccaacctggg 46140 tgacaaaggg agaccctgtt gaaaaagaaa aaaaaaaaaa actaatgatt tcagcaaggt 46200 ttcaggaaac aagatcaata tacaaaaatt agttgtgctg ctatacacta gcaatgaaca 46260 atccaaaaat taaattagga aaacaataaa tttaacaaaa gaaaacttgt atacagagaa 46320 ctacaaaaaa tggttgaaag aaattatgta agacctaaac aaatgaaaag atgtcccatg 46380 ttcatagatt ggaagacttg atattattaa aatgacaata ccccccaaac tgatctatag 46440 attcaacaca atccctagca gaatcccagc tgccattttt gcagaaatgg acaagctgat 46500 tctaaaattc acgggcaaaa ataagattgc atgtcaagcc tggggaacac ccccacccca 46560 gtatttaatg gccaaagaag agccaacaaa caagcctgag aaggaggggc cacatagatg 46620 aaagggacac caggagagag aggtgtcatg gaacccaaca gagcagaggg tctcaaggag 46680 gtcagcaggg actactgctg ctgggaggtc aagataagga ctgaaatttg tccattggat 46740 tagtgatagg aaagtcatga ataacctcag cagaagtacc ttatgtggca tgaagaggtg 46800 agaatgcaga ctggagtgca ttaaagaggg agaaggaggt aaaaaatgag accacaaata 46860 tactttcttc aaagactttg atagtgaaaa ggaggaaagc actaaagcag aggggatggt 46920 gggctgagta aatgtgtgtg tctgtccatc tgcttacttg catttaagaa ggaaaagatc 46980 tcagtttgag aatcagtgag gaggagtgat aaaagacaca agaaggagaa ggagtcattg 47040 atggctctct atcctcagag ggaggaggga ggagaaagat tggccttgga agagggagga 47100 aatgtctcct ttactgtaac aggaggggag atgaggacag ggaagagaga ggtaggaaca 47160 taggtttcac tctgagaagc tgagaggttc ccatctgatg gcttcatttt tctctgagaa 47220 ataggaattg agttgtttgc taggagtgag tgtgagggtt gaggtgttgg atgtttggaa 47280 aaggcaaaat cttcaaatgg ccatagtaga gtgtgagatc tctgccagtc aggcattgct 47340 gtggcccatg tgagggtcgt ggtcatgatt caagatggca gccacctacc ctgttgcaca 47400 ggtgggagtt tgggtggaga gtggaaagat agcagaattc agggttagcc agaggaatgt 47460 gatggaaggt cagagggtaa gaaattcagg atactgccaa tgaaggatca tggaatctaa 47520 cctacattcc aaggaaattg aagaaaagaa agtgataaga gattgaaaga aaggagattg 47580 taagaaagga aagcaatcaa gagattgttc attaaagagg agttacagtg ggagaagttg 47640 agcaggcaga gtaaaggagg caagcccagc ctcgtgctgt gactttctgt atccttatga 47700 ggctgctcta cccactccta agacctatcc aaggagaacc accagagtcc agtaaaaatg 47760 cttttgagca ggactaagga atcctggttc tacctcaaac ttcacacccc agtaggaaga 47820 caccaatctt caatgtactc tttttacatt ggcataaaat ttcactatct caaacccaaa 47880 caactccata tctgttaatt ccttgggcct tccaaaaaac accctgcaaa aatggcagag 47940 cagggacaac tatttccatt ttccatatat aaagaaatgg gctggggaag gacattccct 48000 ggagccagac acacacaggt tcaaatcttg cctctaacgg ttacagattg aaaactatgg 48060 gtaggtttct ttagcaagtt tcagtttact cattaataac atggattcac catgtctgtt 48120 gcacaaggtg gctgcaaagt caaaggagat catgtatgct atgtacttag ggtcatgtgt 48180 gctatgtatt tagggttatg tatgctgtgt acttaggaca tgtatgctat gtacttaggg 48240 tcatgtatgc tatgtattta gggtcatgta tgctatgtac ttaggaccat gtatgctatg 48300 tacttaggtt catgtatgat gtgtatttag ggtcatgtat gctatgtatt tagtgcctaa 48360 gttgtcctaa gaggctcaga gaaattaacg acttgccaaa ttcacatagt tagcagtggg 48420 agagctggac acaactcagt gagggcaaga cccagcacca acacctacct cctctgtatc 48480 aggtaagccg gggcctgatt aattctggct cactgggggt agcaccataa aaagaaagaa 48540 aatctggatc aggcaaatgt gaatatttcc cacaataaaa acatctccat atggaagagt 48600 gtacagcctt acacacgctt ggatgccaga ctacccaaga cccgcaagca aagtcctaac 48660 tattgtattt ctcttttttt atttacctac atctcctcat ttcctccccc tcctcgcccc 48720 tgataaccac catttattct ttctttgtat gtattttgct tttttaatat ttcacccaca 48780 agtgagacca tgtagtgttt ttctttctat gtctggctta tttcactttt ggatttatct 48840 tgaaatgcat tgaataagct ggtgtcagga attctggggt tttgtcacta atggcctgta 48900 aaattgagca cattacgcag acaaaggaga aattggcggg agtggggttg gagagctcaa 48960 aaaaggaatg ccggaaagag caaggtcagg ggagagacca gacagagggt gtggaatcga 49020 agacacatga gggtggaccc ttggcctcct ctggaacaga agagaaagga ctatgtgatg 49080 acacaggcag ctgaggacgg agaggtggag ggacattgag ggttttggtt tccacagcag 49140 cagcatcctt taagcactaa ccttctgcca ggggcttccc taagaacatg gcatacatga 49200 tctccttgga tcttgcagcc agctggtaca atagacatgg cagatccatg atactgatga 49260 gtgaactgaa acttggtaaa gacatctgcc atagtcttca acttgtaact gacagaggca 49320 agatttgaat ccatctctgt ctagccccaa agcctgtgcc cttcacctgc ccactggggt 49380 cactgctgcc caaggtcaga tggcctcagc ctcctcacag aggtccaata ccaagtgtct 49440 tcactggggc ttgggagtgg gtaggattgg aggagggaag tccacacagc acagcttgta 49500 atggtaagtt gtttaaaaat aaataaagcc ccaagagagg cacatttgtt tatgctgaga 49560 tgagtcctgg gggcaggaac atgcctagtg aaaatgagct tgtcaacaag caggaaatca 49620 gataagccag gtgggtagaa ccgaccaccc agggagagag tgggctggca ggagacagcc 49680 atctgggaag taggccctca gctgtgagat gccagccaca ggcagtgagc acagagccac 49740 atctgccagc tgcgtctgga ctttcttggg ccacactctg cagctcaggg ttggaagcag 49800 agaaggaaga cagtgattgt ccagggagac gttaccgagg gcctggtgca ctctaaaatc 49860 caagtgctct acaatgccac atactgagtc gtgtgtgtgt gtctgtgtgt gtctgtgtgt 49920 gtgtctgtgt ctgtgtctgt gtgtgtatgt gtgtgtgtgt ctgtgtgtct gtgtgtgtgt 49980 gtctgtgtgt gtgtctgtat gtgtgtctgt gtgtcaaagt gtgtgtctct gtgtatgtct 50040 gtgtatgtgt gtgtgtatcc gtgtgtgtct gtgtgtgggt gtctgtgtct gtgtgggtgt 50100 ctgtgtgtgt gtgtgtgtgt ctgtgtgtct gtgtgtaaga gagacagggg gaattggggt 50160 ttcgctctgt cacctaagct ggagtgcagt ggtgtgatca tgactcaatg cagcctcaaa 50220 ctcctggata cgagccatcc tcccacctcg gcctccaaaa gtgtggggat tacaggcagg 50280 agtcaccaca ccttgccagg atgtgtgttt aatgcagctg accttaaatt cctggctgga 50340 taggaaagtc cagaaggaga gccagagaaa aacaaagggg aaaagtaggt cttatgagag 50400 caaggaaaga cagagatttt tggagaggat ggagagtggt atttgagtcc aaccagaagg 50460 ataaccaggc cccaccttca gcctccttcc ccctcacaca tagccgctcc cataatcata 50520 tcagatggcc gctcccacag tgtctgccaa ccctgcatct gcagctccag acccaaaccc 50580 aacaccctcc tctgaggtgt cccagaggct cctgaaattc agcctattct tacctagact 50640 atcctcccta actaaaaaca aactgttcct cctctcagaa ccaactgcat cattttcagg 50700 acccaataca aaatgaaaat gcagtgtcct agctcaaaag tcaggatgaa agtgctgtta 50760 aaggtactac gatacagagc tttttctttc ttccatagtt tctctctcaa cctatcacag 50820 tgtattttat ttgctattta acaatgcatt cccttgggca ctgggacact agcagtgttt 50880 gtccagaccc tcaaaggtgc tcggaggcct ctccacagct cagcatgtgc acatggtgca 50940 ccagatgcag ggtgcctctc ccacaagtca ccagatccag gcaccatgcc ccagagtggg 51000 taaaatgtca agccagacat gtctccttcc catgggccac caccccagcc taaagcagat 51060 gggagaccct ccagggtatt gccaggatgc attaggtatc tagatcagtg gtgggggaga 51120 tttgccccca ccaaatcacc aaatgtgcca tgatgctgcc agcccaaatc gagaaaggct 51180 gccacctgcc ctaccccaag atgccccacc ccaacctacc catgcctgat ccaggccccc 51240 accagagata aagcgtggca ataatcactg ggctgaggtt gagtgggggc cccacaacac 51300 tcaaatgcag gatgaccaag aactatttcc cctagaggga gggagggagg tggcaggaga 51360 tggcatcacc tatgagctga ggctccatgg tcccatcaga cttcacctac aaaacacaaa 51420 tttaaagaca aaattattaa gaaattcaag atggccacca cagagcatta acccccccaa 51480 tctagaaggg tccttctcac acccctgcca gttccgggtc cttctgagca cggggccctg 51540 tgcaaccacc ctggttgtac accaatgaaa ctgacactct tctaacatct gctttctcag 51600 taagtgacag cagcttctac tctgaatcca accttgaggc ctccctggac tccactgagt 51660 gccacagggt ctgctccaaa catctcccga agtcgctgct ctgacagttg ctcatcctcc 51720 cccacagacc ccttgcctcc tgcctttccc cacccagcca cctccacttc ctcagggatc 51780 ttgcattttt ttttttttta tttgagatgg agtcctgctc tgtcacccag gctggagtgc 51840 agtggcatga tctcagctca ctacaacctc cctcagggat ctttctaaag caatgcagag 51900 accacatcac tgctggcctc aggaactttc atgggctcat cactgccaca gggaaatagg 51960 tctaaacatc ttataatgac atccaagacc tactactatc ttgatatttt ctagctttac 52020 cctcagactt tctctctacc tgggtagcct ttacgctaac atcactatac taaccgcgtg 52080 ctgtgatttc acatctgccc taccagccac gcgtgcccct gcctccgccc tcacctcagt 52140 gaatcactca tttaagtttc actgtttcca ccctgccttg gcaatcactc attcctctgc 52200 cttcagctgc acctgacttg cccctctttt gaggctattt cattcattta tttactacat 52260 acttactaag cacctactgt atgctagacc ctgctctggg cactgggata tgactagtaa 52320 tgaaaatggg gctgggcacg gtggctcacg cctgtaatcc taacactttg ggaggctgag 52380 gtgggtggat cacctgaggt caggggttga agaccagcct ggccaacatg gcgaaacccc 52440 gtctctacta aaaatacaaa aattagccgg gcatggtggc aggcacctgt aatcccagct 52500 actcaggagg ctgaggcagg agaatcactt gaacccaggg cgcggaggtt gcagtgagcc 52560 gagatcacgc cactgcactc cagcctgggt gaaagagcga gactccatct cacagaaaga 52620 aggaagggag ggagggaagg agggagggag gaaggagaga gaaagaaagg agggagggag 52680 ggaggaagga aggagaaaga ggaaaggaag gaaggaagga aggagaggga gggagggagg 52740 gagggaaatg ggcaaaattc cttcctcaag gagcttacat tctagttggg agggaggaca 52800 agaagccaaa taagtaagta aaatatattg tgagtgacat gagctataga gaaaagtaaa 52860 gcagaaagga gcagggagtc tggggatggg tgtgcaattt taagtggaga gacgaggcaa 52920 gatgtcacca aagcgacttc cttctgagca aagacctgac gcaagtggag gagcacccca 52980 tgaggtgtca gcaccagaag attgttccag gcagaagaag cagcagaagc agcaggtgca 53040 gagaccttag gtggaactgt gcctggggtg aggaaggaag agagagaagg gcagtgacta 53100 tggacctaaa agagattaga gatgaggaca gagatgatga ggatccaagt ccccagagcc 53160 ctgtagacca ggaaaaggac cgtgaattgt attctgagta atgtgggaaa acattggagg 53220 gttttaatca aaaaggatca aaaaactaac tattatgtgt ttctatgtct atccccaagt 53280 accatgtgca tttctggggc ctgggagggt tttctagacc tctgcctggc ccacaggcct 53340 ggtgcatagt aggtgattaa tggttgctta tgggtgtctg ttgaatcaac aggtgagtaa 53400 cctgaagacc atgggtgtgt gtcttccaca ggttggctgg ctgtggcctc acagctgagg 53460 actgcaagga ccttgccttt gggctgagag ccaaccagac cctgaccgag ctggacctga 53520 gcttcaatgt gctcacggat gctggagcca aacacctttg ccagagactg agacagccga 53580 gctgcaagct acagcgactg cagtaattgt tcttggtggg cagggggagg tggagggaac 53640 tgcctgccac catgaggcag gcagagggta agagcagaga gaaggtgaga cctgctttgt 53700 gggagtggga ggagcctaca ggatctcagc caggcagtag aggctcagga ggaagctcat 53760 ccacccttgg acagagggag ggggcctcca ggctccaaat acagcaagag aagccaagtc 53820 gagtcaaggg acgtgaatgc aaggtgggga cagggggcag aatttgtggc cattttcaca 53880 gactaccaca gaggatgatg ataatcgtgc ccacttcata gagctatgag gattgcagca 53940 gtgtttttct attcttctcc tgttcccacg agctgtgatc tgcctagtaa caggtcctct 54000 tacttggcat ggggttctgc catggagagc tgaaatcact ggtgtccact tagttcgatt 54060 tactatctca tccgagggga cagagccatc ccccccgcca ccatgtagaa acaagtgagg 54120 cgccctggct catgcctgtc atcccagcac tttgggaggt cgaggcaggc agatcacttg 54180 aggtcaggag ttccagacca gcctagccaa catggtgaaa ccttgtctct actaaaacta 54240 cacaaagtag ccaggcgtga tggtgggcac ctgtaatccc agctactcag ggagcctgag 54300 gcaggagaat tgcttgaaca cagaaggtag gcattgcagt gagctgaaat cacctcatta 54360 cactccagcc tgggcaacag agcgagactc cttcttaaaa aagaaaagaa aagaaacaag 54420 agtgcatagt tacctctccc tggtcctgct gggcctcctc ccttgatgca caaagttata 54480 cacacccctc tgtgcacccg ctcagggggc tgccttcagg gtcttctttt gtttctgccc 54540 aagccctgaa atatccttgt ttacaccttg cgagatcttt gttcacacat cacacgtccc 54600 ttccttcttc ccaaggccaa acctctgccc ttcccaaggc tattgcctgc ttccccattc 54660 agtaatcctg atcaagagtt gatgggaagg gggcatggga aggggagaca acgaggatcc 54720 tctcctagta tcaggggtcg gcgatccctg gtcggctagg aagtcggtaa cacttgtcag 54780 atttaatcct cgctacatct acagaagaca agattggatc tcggagagat tcattatttt 54840 cccaaggcca catagccagg aagtagctgc accaggattt gaagccaggt ctgacctcaa 54900 agcctggttt tccattctcc ctgttccctc tcttagcgaa ggcccctgga atgctttgtg 54960 acatttacaa gcttggactg cagctgggtc tgagcagacc cagctgggag agccaacagt 55020 tctagaagca gggctgagac cccaagagca gaggatcccc agacaggaga acccccagag 55080 agagcctaca tatatacaac atacagcagg tacatagccc ttttctggca taaatggtag 55140 cacagcacaa actctgccct gcactttgct ttatcagtta aaatatatct tggagatggt 55200 cccacatccg tacgtacaaa gtcacatcat tctttttaat cacagcatgg aatttcacta 55260 catggactta ccatcatgtt tttatccatc gcctattatg agcagccaga ttgtctccag 55320 tgtttgctgt aaacagggca gcagttaaca tcccccagtg cgtcctgaga cccagcatat 55380 tatctgtagc ttaacttccg aaaaatggaa ctgctgggaa aacgatatgt tgctaatttt 55440 gccctctaaa aaggccctac cggccgggca cggtggctca cccctgtaat cccagtactt 55500 tgggaggcct gggacacaga gtgagaactc gtctcaaaag aaaaaagaaa aagagaaaac 55560 agatagaccc aggtgtacaa gtgggacatt gagtaagtta ctcatcaatg tggacagagt 55620 gaggggactg gatcagtgtg aatcacacag gattgagaaa gggtaggtgc tggcatcctg 55680 aggggttccg tggctgcaag aggaaagtga aaatctctca ggttaccgca agccaggaaa 55740 gtttgtatct tctctgcatg tcatcccaga ccctcaaaag gccacagatg ctgagaagga 55800 agatcagacc ctagttaccc tgaagagcga gcccttagga caatgacctg ggctggcgtg 55860 tatctttcct gcaggctggt cagctgtggc ctcacgtctg actgctgcca ggacctggcc 55920 tctgtgctta gtgccagccc cagcctgaag gagctagacc tgcagcagaa caacctggat 55980 gacgttggcg tgcgactgct ctgtgagggg ctcaggcatc ctgcctgcaa actcatacgc 56040 ctggggtaag gccctgtggg tcccttctgg gcagagggaa gagggttggg ggggtatcgg 56100 acaggggatg gaaaaaggac tgataaatgc atagccaacc caagtttctt accagctcta 56160 ccacttacac tgtgtgactt ggacaagtcg ctgaattctc tgggactctg tttcctcatc 56220 tgtggaatgg gagtatcata cccacttcat ggattgtaat gaaataaaat gtatataaaa 56280 acaaggacag gcagggcgtg gtggctcacg cctgtaatcc cagcactttg ggaggccgag 56340 gcgggtggat cacttgaggt caggagttag agaccagcct ggccaacatg ctgaaacccc 56400 atctctacta aaaatataaa aactagctgg gcatggtggt gtgcacctgt aatcccagct 56460 acttgcaggg agaatcactt gagcctggga ggcagaggtt gcagtgagct tagatcacag 56520 cactgcactc catcctgggc aacagagtga gactcagtct caaaaattaa ttaattaatt 56580 aaacaaataa aataaaaaca aggacagtgc tgggaagaga gtcgggcatg atcaaatgat 56640 aactattact gaggctgcat gctctagggg acgctcaggt tcccctcact ccagggcagc 56700 gcttcattca ttcatttact cattcattca ttcactgagt gcctcccgtg tgtaagcacc 56760 agaccaggcc ttggggtaca atagtgacca aacagacatg ggccctgccc tactgaagct 56820 tctactctag tggggaagag acggacactg gtcacctgta aatgcacaag tctatattta 56880 tgagctctgt ggagtgaaag gaaagtacag ggtattatga gaatgtaata ttgagggagg 56940 atgggatttg atcaaagggc cagggaggac ttccctgaga cagtgacgtc aggctgaggt 57000 ctgaaggagg aataggtacc caggccaagt tggggagagt attctaggcc tagggatggt 57060 gtggggaaag gcccagaaca aagaggcact tgagccccgg agaactgaag gaccctgagt 57120 ggagtgagcc cccagccctc atttcccatc ccgtaggccc cagagattct tgcccttccc 57180 cgaggagcca cccccacact atctctgggc cacttctgct cccctccacc tccctgggag 57240 caagcccatg agtggcttct gtcttgacac ttacccacgg ctcctgccag gggccctgcc 57300 acatctcagg gagcagacag gggctccaaa cccccacatc cagagccttc tcagaggggg 57360 tccgggaagc ctgggcctcg ggctttcctc agccaggcct cccaaaccta aagaagtggt 57420 caaggcagaa taacagaaat aaatagtcta catttgggtt ctcctgtagc tgacactggc 57480 tatctacaac catgggaact ccatcccaag ggggcagcac ctttgcaccc cagcctgctg 57540 ctccccactg agtatttttg gctatcacta cctcccttat gctgaagagc ctttccccag 57600 agcttggcca tggcaggcag atatggcttc aggccccagc tcaacactta cctgtcaaac 57660 ctgggctgct tctgaggggg tttgctctct agagtcccga gtaggtttcc acctccttcc 57720 ctcctggagc cgggacttag ttgagaagaa taaccttcta cttcaaggtc tgcgctcctc 57780 cccaggggca gaacagctct aacagcagta ggggaatgaa tgcagagccc ttcgcagaag 57840 acagggtctt ttcaccgcct ccttgtctgg gcctcagaac aggcagggca ggattgattc 57900 atcctacgtg gtagatggag gacacagaga cccagagaag acaatcagca tctcacatct 57960 ggcagaaacc ccagaccaca aggcctggga ggatggaacc aggactctct cttcccagca 58020 aaacttccga ccactgggaa gaaaatccca ggacccaggc ctgaggtggt cattttgagg 58080 ccatggggaa atccctggca gccggctgta gagacagcct ctgccagtga gagccacctc 58140 acccaaggcc acatctcctt cctggggagg cccacatccg atgactggcc aatgttgggg 58200 tagaaaggcc cagcccctca acccaactcc tttaaacagc tctaaaggat catctcccag 58260 agttggctga ggcctccatt ggagactgcc tcacagccca cgtctccccg attccctccc 58320 ttcccttgcc ctggcattga cccaagaaca caccctaatg aatctcctca ctgtaccctc 58380 ctcctcagag cctggcttcc agggaaccca ccctgcggcc gctgcctcag aaggtgacag 58440 gtcccaggat tctgggccct ggggacctgg ggaccacatg ggggatccca ggaggacccg 58500 gctccctttg tgctttcagg ctggaccaga caactctgag tgatgagatg aggcaggaac 58560 tgagggccct ggagcaggag aaacctcagc tgctcatctt cagcagacgg taagggggga 58620 cccagaactc cctctggctg acccaggccc agggctgagc aggagacagg gaaagagcca 58680 tggggtgaga gacaaacaca gccagggaga aacccagaca ccacagcagc cagagtgagg 58740 tgcacttgtc agtgcaagta gaatcaaatc cccccttgct tcaactctta acgttagaca 58800 aacaaggaaa ggaaaacatg aaaaagtaag tgggctaaaa aattagccag gcgtggtggc 58860 gggtgcctgt ggtcccagct actcaagagg ctgaggcatg agaataactt gaacccagga 58920 ggtggaggct gcagcgagcc aagatcgcgc tactgcactc cagtctgggt ggcagagcaa 58980 gactgcatct caaaaaaaaa aaaaaaaaaa aaaagccggg catggtggct cacgcctgta 59040 atcccagcac tttgggaggc tgaggcaggt ggatcatgag gtcaggagtt caagaccagc 59100 ctgaccaaca tggtgaaacc ctgtctctac taaaaataca aaaattagct gggtgtggtg 59160 acgcacgcct gtagtctcag ctacttggga gtctgaggca ggagaatcac atgaacccag 59220 gagacagagg ttgcagtgaa ccaggatcac gccattgcac tccagcctgg gtgacagagc 59280 aagacaacat ctcaaaaaga aaaaaaaaag aacaagaaaa agtcagtggg tgggtgagcc 59340 agcgaatccc ctggacccag cacagcatga taccaaatgg cctctagtcc aggaatcctt 59400 ccatgaagaa cagagaaggt ttggaacaat gaaagcgatg atcacaagag ggagataaag 59460 acagagagag agagcgcctg tgccctggga aaaagaccat cctcgccatc accgtcaccc 59520 accaccctca ccaccaccac cacacacacc cctgacttga ggggcaagcc ccggggccag 59580 gaaagtccat cttcccttcc ccgaggaatc tgctgcagct cccacattct cccagttggg 59640 tgcatttcca cctgaccatg aagctcatct gtccccaggt tcctggctgc tgtggctcac 59700 ctgaaatgga aggaaaacag ctcctaggct taagaggagc catttattca ttgaacaaac 59760 attccctgca gacccactat gtgtcgggca ctgtttgggg atctaaggac acagcagtga 59820 acaagaaaga cacaacccaa cctccaggag cttagtccag cagggagaca ggcacaggtt 59880 atcccagaag tgtctgcgca aatgcacctg gggtcagtgt gagtgcacct atgggcctat 59940 ttgacctggt caagaggatc tgggctgagg tctgaaagac aggacaggtt ggctgggccc 60000 aagaaggggt ggagtccagg cagaagtaga ggctctgaga gagaaaagag tttagcctcc 60060 tggaggaatg ggaggccagg cgtcaggagc agctactcaa actgtggtcc gtggaccagc 60120 agcatgagca tcaccctcac tgggagctcg atagaactgc aaattctcag gccccactcc 60180 agacccactg aatcagaatc tctggggtgg ggcccaggaa tctgtgtctt tttttttttt 60240 gagacagagt ctcgctctgt cacccaggct ggagtgcaat ggcatgatct cggctcactg 60300 taacctctgc ctcccaggtt caaaccattc ttatgcctca gcctccagag tagctgggac 60360 tacaggcgca tgccaccaca cccagctaat ttttgtattt ttagtagaga cggggtttta 60420 ccatgttggc caggctggtc tcgaactcct gaccttgtga tctgcccgcc tcagcctccc 60480 agagtgctgg gattccaggc gtgagccacc gcgcccggct gaatctgtgt cttaacaagc 60540 cttccagatg attgttaagc gcacagtttg agaaccgatg gactagaggt cagtgagcca 60600 ggaaaaagtg gcccaggtga cattggaggg tgacagaggc cctggcaagc ccttttggac 60660 tatgttaaaa actttccatt ttcccccaga agcaatgaag cgacattaca gggttgcagc 60720 agccaagtga tgggtcagtt tcctatttca agaatgggct gctgtgtagg aaaataggcg 60780 gcagagggga cccgagcaga agcagggaga cctttcagag gatgctgcag tggtgcgggc 60840 cacagaggcc agggcctggg actcaggtgg aggtcatgga ggtggagaga agacaataga 60900 aagaagagct atttgagagg tgttgtcaag gtgatcagct cttagggtgg gcgaggaagg 60960 agttgggatg acaccccagc tctagcctgg atgcatgggt ggatggaggc tgctcacaga 61020 gacagggacc acagaagaac cacaggattg agttcaaatt cggatggagt ttttttgttt 61080 gttttgtttt gtttgagaca gggtctggct ctgtcaccca gactggagtg cagtggcaca 61140 atctcagctc actgcaacct ctgcctcctg ggctcaagtg atcctcccac ttcagccttc 61200 caagtagctg ggatacaggt gcatgccacc atgcccagct aatttttgta tttttggcag 61260 agatgaggtg gggaggtctc actatgttac ccaagctggt ctcgaactcc tgagcttaag 61320 ctatccaccc atcccggcct cccaaagtgc gaggattaca gacatgagcc actgcacccg 61380 acccgaggag gagttgaatt tttagtggga atgtcatgtg gacagccctc gcttcctcca 61440 agatcttagg cagaatgaac aattctttgt ccaggaaacc aagtgtgatg acccctactg 61500 agggcctgga tacgggagag atgagtaata gcacatcctc actcaagcgg cagagactcg 61560 gatcaggtag tttgaagggg caaggctggg agcatcgtga caggggttgg gggaggtcag 61620 ctgtgggagg tcattccctc catccctctg cccctgatgt gggacggggc aggtttccta 61680 cccaagaggg tgctaccccc acgagagagc tcccactcct aggaactggc tggatttccg 61740 ttaccctctc tccgagaccc accggctggt gcttcccctc tccagaacgt ctatggcatg 61800 tatcaaaccc tgaaccgggg tccggcctct tccattcttg gctgctaagg cctgggaatc 61860 acatctccat ctttgccctg ccatctaaaa aaaaaaaaaa aaaaaaaaaa cagagtctca 61920 ctctagtgtt caggctggag tgcagtggca tgttcatggc tcactgcagc ctaaaccttc 61980 tgagctcaaa caatcctccc acctccgcct cctgagtaga actgggactg cagatgctca 62040 ccaccatgcc tggctgatat ttttcttttt tctgtagaga gggcggcttc ccatgttgct 62100 caggctaatc tcaaactcct ggacgtgagc aagatcttcc caattgctga gattgcaggc 62160 aagagccacg gtgcctggcc cctttcatct tttgaaccat gttctctctc aacctgccca 62220 cctgggcatg ctggggagag tggcacttct attccagtgg ggaaggggag atgggcggag 62280 gctgggctgg aggcactgga gtggggctcc ccagacagca ggcaggcagc catggagcag 62340 gagcagaagc tgcagacaca ctccctgcca tggtccaggc aggccagcct tgggaagggc 62400 cccttacagt gcagccatag gggctagggg aggccaggcg ggagaatctg gagctgaccg 62460 ctgccttgct gtttcagagg aaagctcccc agaggtagta ccggtggaac tcttgtgcat 62520 gccttctcct gcctctcaag gggacctgca tacgaagcct ttggggactg acgatgactt 62580 ctggggcccc acggggcctg tggctactga ggtagttgac aaagaaaaga acttgtaccg 62640 gtgagtgagg gggctctgct ggtcccaggc tggcctcccg caccccactg cccatcctgg 62700 gtccccacag acagtcagcg ccacccccta cccccaactc acactcacca ctgggtcccc 62760 tcctcttgcc aagacaacag gcagagacag ggctgagaca ctgggtgtct gcttttcata 62820 tggaaaaaga tgttttctta tgaaacttaa aattttcaaa catatacaaa aatagtacaa 62880 ggagcttcca tgtacccatc acccagcttc agccactgtt ttgccaaatt cgtttcagct 62940 gttcactttc cccctctttg ttattgttac agtattttaa atccaagaca tcgggtgatt 63000 ttacccttaa atatttcatt acgcatctca aaaaacaaaa ataaaaacct tttcttccat 63060 aacataacta caatgccaat atcacatctg acaaagctgg cagtaagtct ttcatttcat 63120 ctactcccca atccatattc aaattgctca attgtctaaa acatgttctt atagttggtg 63180 ttgtggtttg gttgggtttg gtttgatttg gttttgagac agagtcttgc tctgtcactc 63240 aggctggagt gcaatggcac aatcttggct tacctcaacc tccacctccc aggttcaagc 63300 gattctccag cctcagcctc tctagtagct gggactacag tcacatgcca ccacgtgtga 63360 aaaatgaaaa ttgtattttt agtagagatg ggggtttcac catattggac aggctggtct 63420 cgaactcctg acctcagacg atccaccttc ctcagcctcc caaagtgctg ggattacagg 63480 cgtgagccac cgcgcctggc cagttggtgt gttcttatca ggatccaaac aaggtccact 63540 cattttattt tctttgttat atttcttaaa tctcacttaa ctctcttaat ctgtatgctc 63600 ctgttttatt aacagccact aatgagctaa ggactgtact agtagctggg gaccctaaga 63660 tgaatatggc agccctgccc tcccagggct tacagtcgca ctagtagtaa ttctctatag 63720 tcatactcat gatctcttat ataacatgtt accactcacc aaacactgcc acagacatca 63780 tctcaaccag gagacagcgt cactcccagc tcacacctac agaagctggg gcatcccggg 63840 acccactatt aacccacctg aggagccttg ggaaaatcac ttctgatctc tataaaagtg 63900 ttgtttccaa ttctaagggt ttagcattaa tattagcaca tacggccagg catagtggct 63960 catgcctata atcccagcac tttaagagac cgaggcagga ggatcacttg agtccaggag 64020 ttcaaggcca gcctggacaa catggcaaaa ccttgtgtct acaaaaatta ccaaaaaaat 64080 tagctggacg tggtgacatg tacctgtagt cctagctact cagggagctg aggtggaaga 64140 attgcctaag ccagggattt cgaggctgca gtgaaccaag atcgggccac tgcgctccag 64200 cctgggcaac agagtgagac catatttcca tagattagat agatagatag atagatagat 64260 agatagatag atagatagat agattagata gattagatag atagatagac agacaagaca 64320 agacagataa tcttagcaca tatattgtta ttaattattg ctaaataggc cccttacata 64380 tacgaacact tgttatttca tcattaccat cattgccatc atcaacatag tagaaacagt 64440 tcacagctcc caggcgtgta ctaagctgtg agcatgcatt aagtcactta ttctcgtaac 64500 taggagatgg gagctataat tactcctact tacagatgag gaaaccaaaa acaggaaggt 64560 taaatgactt gctcagggat ccacagtcaa gagcagaaga gccaggacta aaatgcaagt 64620 ccatctgatt ccgaagtctg ggcttgaagc cactgtaccg tatccgcaag cccctgcatg 64680 gggtctggca tccagcaggt gctcagtgag ttcgtgcagt aagtgtctgc ctctccctct 64740 tcttcccaga gttcacttcc ctgtagctgg ctcctaccgc tggcccaaca cgggtctctg 64800 ctttgtgatg agagaagcgg tgaccgttga gattgaattc tgtgtgtggg accagttcct 64860 gggtgagatc aacccacagc acagctggat ggtggcaggg cctctgctgg acatcaaggc 64920 tgagcctgga gctgtggaag ctgtgcacct ccctcacttt gtggctctcc aaggtaaaca 64980 acagggaagg atagggagga aggtggtggt aattatgggc ctgaaaggga gtgtagaatg 65040 ataatctcct gggagagggt gttgggggtt ccttagaggg gccgaggggg ggttatggag 65100 atggggacta agattttcct gcattccact cttctctgga cttatctttt tgaaacacat 65160 ctcccagcct cataattata tctgcctaaa cttacttgaa agtttttatt aagtgtcatt 65220 tatatagact gtatcactca aggtcccagc aggaaagaga tgatatgctc agttgagatt 65280 tttgaagaag cttttcaaag acaggaccat ttacaaagta aggctaagtg aactaacgaa 65340 gaatgttggg gcaacaggga ctagcaacaa taggaagcca tcactactcc tacaggcagg 65400 gagaagaaac agtgtcatca gacgccatct gagagctgga gccatgaaac aggggcttcc 65460 catagaagat agtaatcaca aagtatgatt ttatgtcacc tccaaagact tcccacagag 65520 ttccaagaaa tatcagctaa cagtcaaaaa tctcacaaag gggaaacaag gcaccagaag 65580 taagagccag cagaaacgac agataccaga atcagtccct taaatccttc agatgctgaa 65640 attatcagaa aaagaacata aaataagtgt gtataatatg accaaaaaaa aaaaagtaac 65700 cttaaaacta tcagcaaaaa gcaatgaacc attaaaaatg acaaatcaaa gttggaaact 65760 aaattgacta aaaaaaatga aaaaacaagg ctgggcgcag tgctcacgcc tgtgatccca 65820 ggactttggg aggccgagga gggcggatca cgaggtcagg agatcgcgac catcctggct 65880 aacacagtga aaccccgtct ctactaaaaa tacaaaaatt agccgggcgt ggcggcaggc 65940 gcctgtagtc ccagctactc ggaaggctga ggcaggagaa tggcgtgaac ccaggaggtg 66000 gagcttgcag tgagccaaga tcgcaccact gcactccagc ctgggccaca gagcgagact 66060 ctgtctcaaa aaaagggaaa aaagaataaa caatcaagta acttgtagga taccatgata 66120 agtcctaata tatatctaaa tggaattcca gaaagataag atagagagaa gggaaaaaaa 66180 tagaatattc aacaagacaa ttgcaaacaa atttctggaa gtattgagac atatgagact 66240 tctcagaatg agaatcccca aaactgcaat gcaggaaaga gagagagaag ggagagaaaa 66300 aatcaagatt agtctcatta atgtgaaact acagaataag aaaatatgag ggaaagcctt 66360 gacatcagcc aaaaagaaag ggttataggt gtgagccact gtgcccagcc aatgtctgtc 66420 aatttagtga gcaagttaga agaatctgat gtgtccccat gtcaaatcat gactccaata 66480 gaagcttcat ttaattgctg ataaaatgat ttgataaaca atttatgtta tagcttacat 66540 cagttataag cacttttatg atttgtacag tttagctaga tattaaatgg gtttttaata 66600 ttttttcttg tcttaatttc tttatttccc aggaatagag gtaaatgaga atttattttt 66660 aatttctcat ttgttagttg atatgttgta gtatcatccc agtagttcaa tgtattttta 66720 gtatcacctc aaaataaagg tattaacttt acagtatgca tatattaagg taatcaaggc 66780 atatatataa gaaaaattat ttagaagcta gtagaaagag aaaaatcaaa tgagaaaaat 66840 aaaaagcagg gaactgctca gtacccagaa ccaggaaagt cttgtttgtt gaggccacaa 66900 aagttcagta aacaattcaa gctgcacacc aggaggagag catggggcag ggaaatagag 66960 tcccgggggc ctcaggatcc ctggggcacc tggttgaaca gtcagggtca tatgggcatt 67020 ccctagggaa ggtcccagac taaaacagaa tagagggaag aggtgcaaga cagctctgga 67080 attgtagagg tccatttaga aaggaagaga ttgaggagag gaaactgatg gctgaaacaa 67140 gaagctggga caagttactt gttcccaccg ggcaatctat agagtaatga acacagccac 67200 taccacaacc agcctgcctc ttccacaagt gagaccgtgg ccagggatcc ccaggtactc 67260 tatcaagaag agaggggcct ggctaataga gtcagtgggc tcagctctgt cgtagctctg 67320 cccatgacca gcagcaaaag aacttggctt gccccaacct gggatgctca gggctgcacc 67380 accactccct gctctaacca gggtattcca ggagaaacct catgcagcta aggtctggag 67440 ccagaatgta ccaccagggc ccgaggggac aaagtctgcc ttcaggtcct gctccagcgc 67500 cccctcattc cccactctta gaggtggatg agctggccct ccccattgtg aggactccca 67560 ggccatgccc tgctcagtct cctgcgtgca tcaagtgggc ctgggagacg ggatttgtcc 67620 aaggaattgg gtgaccctcc gagtgcacag gcatctagct gaagcctgcc aggctgacag 67680 ccaaatcaga ataggaatga attcactgtg cttgtggcac tgtcctttgt ggcactggaa 67740 cttggccaat aagaactgtg aacaataact gagcatcctg tgcctggaac catgcaaggt 67800 agttccgtat atattaatat catctctttt ttcataatga accgggaaat taggcactac 67860 agttttccct tggtgtacca gtgaggaaac tgaagccgag agaggtgaag accctcttcc 67920 agagttggga ttctatccca agtcctgttt atcccaggct aagagctaca cccctacccc 67980 tccccagtcc tgcccaagcc catgtcccac tagccttctc tagaccaaat tctgagcctc 68040 accttgtggc cagataggat ttactccctt tggactccgt acacaagctt gtacaacctg 68100 cggcctgtgg gccacatgcg gcccaggatg gctttcgatg aagcccaacg tgaattcgta 68160 aactttctta aaacattatg agactttctt gcgaattttt taaagctcac cagctatcat 68220 tagtgttagt atattttaca tgtgtggccc aagacaattc ttcttcttcc aatgtgaccc 68280 agggaagcca aaagattgga cacccctgcc acagaacaag aggcaaaatg cacagaggct 68340 gcttttagct agctgtgacc agccaggaaa ccaatccctg gggccgggca gggccccttc 68400 cagaaatgct tgctgcctgc tgtctgggct gcagctgttc attgccctgg aaacgatgac 68460 ctcactgcct gacacagggt ctgatttagg acttgggcac tggtgcttca tcactgtcgt 68520 cgctgaacgg gtgatgagtc attaactttc catggtgagt tttctcccac tatagctaga 68580 gcttgaggct ttcagcctct cagcttcatc accgtggcaa tgggcattct caccctcctg 68640 caggaccagc tccagcagtc atagggccca gcacaaaatg aaagcgcgga gctcttcatt 68700 caaaaatgat taagaatgtc aagaccacca aacatgaaac caagcaccaa gccctgctga 68760 gctcaggacc ctgtgtgact gcacaggcca cacatccatg aagctggcct tgtccttcgg 68820 gtccttcgtt cccccaccca cccatctcct tcagcacaca cagacctctg ctctccacac 68880 aaccctccct ccaagaggaa gtgagttccc tcatttttgt cacacttgct cctctctgtg 68940 ctgcctgtgg gaacaagaac acttcacaca gacacctgag tggggtctcg ccccattccc 69000 actccctccc cctgcccacc tactcaggag ttcaaccaga tgagatggct ggaaaccaca 69060 gtggtatccc tcaggcctca aaactgttgt cttcctggaa ccaacactta ctgaatcgta 69120 taacaatata gagtcccaaa actcagatta agcaaaatgg atattcaaaa taccttagtg 69180 gctgggtgca gtggctcaca cctgtaattc caacactttg ggaggcagaa acgggaggat 69240 gacttgagcc caggagtctg agacaagcct gggcaacata gtgagacccc atctccacaa 69300 aaaaatattg aagaattagc caggcatggt ggtgcgcacc tgtagttcca actactcagg 69360 aggctgaggt gagaggagca cttgagcctg gaaggtcaag gctacagtga gtcatgatcg 69420 tgcactgcac tccagcctgg gtgacagagc aagactctgt ctcaaaaaaa aaaaaaaaat 69480 ttagtagctt tttcttccca gaccccaaga gttaccaaga ggactgggcc gagttgtttg 69540 aggaagaaac agaaggaacc accacagcca ggtggataga ttggtcaagc tgcattttat 69600 ctgtaatacg agcaagtggt ccttattatc cccagggctt tatagacaca aacccagtca 69660 atctgcacag tgacaccggg agggggatgc tgttatcagc cccactttac agatgatgca 69720 cagagagggc aagtaatctg ctcagagtcc cacagttggt aaaaaccctt gctgggagtc 69780 aaagccaggc agtccctatc atttagctgc tctgctagac aaccctctag tgccctacct 69840 tcatgtgatg tgggtcccac tgtccctgct tgagtaggag agccactgtg cccctcaaca 69900 gctcacctga gcctggaagg tgctgaccac acacactcag caggaagggc acggccaggc 69960 atcaggccac aggcccaagg gcagcatcta aaggagggag agtccccagg ctcctggaca 70020 gcaggagcag caatgtccca aggcaaacac cgggcatgcg gctgggaaag gaggaaggag 70080 ctgacaggct tccattggcg gagcacttgt catgctttga gccattgttc tatttgctgt 70140 ctgtattccc cattgggcca caagctctgg aattgcagtg cccatgtctt tcctgttcac 70200 agctacatcc acaggggctg gcacattgtc cgtactcagt aagcttttgt gtaataagca 70260 gatgcaaagc cctggacccc agacccacag aaacatgcct acctcatgat agctttttct 70320 aaatttttct ggtgcaacca gccaaaggca gcgtaggtct ttctgccagt gagacttcct 70380 ccagtcaaca ctcaaccttc ttctccttca gaccacagtc ttccccattg cttcccttct 70440 ctctccaatc tatcagcaca aatatactgt caggtttgga tgaggtgact ctaccacaat 70500 ctatggaaag ggcagtatgt gcaagaaggg gggtgaccag ctcatcttaa cactgacagt 70560 cctgcacccc caggagccct ctcggtcctg ggcaagcatg gacagttggt caccctagca 70620 agaaggtaag ttggagacac ccagcacctt ctggaattca gcaaaaattc aagtggcacc 70680 gagcgtctgt tagtgctcca gactgtggtc agaatcctgg gatctggtcc taggaccatg 70740 gctggttgtg tggccttgga aaaagcactt tcctcccagg atttcagctc cctcaactga 70800 catgtgaaga cattccagtg ctttggaggt caggatgaga ccatccattt agtgccagtg 70860 atggcaacgc aaaccagcat cgcctcttgg agaagcaact ggggagtgaa ggaggtcata 70920 ccatctcaca cgctttcctc atcccttggt actgattgat ttcaaggcaa taacacaagg 70980 gaaatagaag ctttctgcag aatatgtgca ttgcaggtgc tatttatcat ggtgaaaagt 71040 tggaagcaaa ctcaacactg aaatgagaaa caggctgaat cacagggtgg tgcatccact 71100 agcagggatc atatactgtg gtaattaaat ggtctgacat ccttgcagca agttaaactg 71160 tcacaggaaa caagagacaa agtatcatgg actacaataa aaataggaaa aggtgttgcc 71220 gcagagaaag actgtaatgc aggcgtccca tgctaagaat agttatactg tggccaggcg 71280 cggtctggga ggccgaggca ggtggatcac ctaaggtcag gagttcaaga ccagcctggc 71340 caacatggtg aaaccccatc tctactaaaa agccaggcgc agtggtgcat gcctgtagtc 71400 ccagctactc gggaggctga ggcatgagaa tcacttcaac ccaggaggcg gaggctgtgg 71460 tgagccaaga tcacaccact gtactccagc ctcggtgaca gagctagact ccgtgtcaaa 71520 aataaaaaaa gaatagttgt attgtttgat tggtaagatt cataggtgaa ttcttcctca 71580 ttttttcaga tgatagatac cactgtgact ttattttcat cccaaaaacg cagttaaatg 71640 acaaaactta agtaatggtc ttcgtggacc cttctgggat tgacatctga ctaagggctt 71700 tgtccagctt gctggatgcc ttgttgccca gaaaccactt gcattgcagc agttaccagg 71760 caatgcaggg atggagctgg gacctctgat ccctaggcca gtgtctctcc cattaccagt 71820 attcatccct caagctctct tgtaacttca atgtccttgc ctggcgctgg tcagagactg 71880 caatcaaagg taccaattct ggctgattta agcagaaaag ggatacattg aaaggttatt 71940 gactaactca tagacttagc aaaaggttat gcaacaaatc taagcggggg ggaagatagt 72000 gacccaatga ctcagagaaa ccaagggcag agccaaggtc acatcaggaa aatacggaga 72060 caggcggcag aaaatacgga gacagctact gtaggaccct cctgcccagg agccactggg 72120 ctctttttct ttctttctgg tattttgttt tttgtttttt gttttttgtt tttgagacag 72180 ggtctcactc tgtcacccag gctagaatgc agcagcacac tcatagctca ttgcaacctt 72240 gaacttctgg gttcaagtga tccgcccacc tcagcctcct aagtagctgg aattatgagt 72300 gtgcaccacc acacctgcct ctcactggac tcttcaccca actgctactg ccagaattct 72360 ctctctctgc atctgtttca cccctctaga cttgaggtcc cagttagaga caaccactta 72420 gtcaagtctt ggtcacacac ctgtgtcctt gctgccaggg agtagtagca tcctgagtgg 72480 ggggcttggc ttgcctcctg ccaagaccca cacagtgctg gaggaggagg ctcttcctaa 72540 aaagaagagt ttgaatagtg agtggccaaa caccttggtc actctatgtt cctactgccc 72600 cttcctcctc cttgttctct acctgcctag tcaaaccttc cagcatcact caatggtctc 72660 tcatagagaa tccacaccag cacaggcaac tagaggttct atatctttcc cagctccagg 72720 gccaggggct gggtttgcgt ataagtttgg ctcaagggtg ctccttgagc tccacttagg 72780 agttcagcac tgctaagtgc tcccacctgg tggcacaatt tcccacagcc tccaaaggcc 72840 ccaggcctag aaatgccctc ttccccatgc tgggtgtctt accctgttca ggctgctata 72900 acaaaatacc ataaactagg taccttgtaa ataacaaaaa tttgtttctc acattcctgg 72960 agcctggtaa gtccaagatc aaggccccag cagatttggt tttttggtaa ggcccattcc 73020 tcatacatgg agccttctag ctgtgtcctc acagggtgga agggacaagc aagctccctt 73080 gagtctcttt tgtaagggca ctaatcccat tcatgagaat tccaccccca tgacctaatc 73140 acctcccata aaagcttcac ctcttaatac catcaccttg ggggttagaa tttcaacata 73200 tgaattgtag gaggacacag acattcaaga ccatagcact gggctccccc aagagggtac 73260 ctgagcctaa gccctgggat cctacagtgc cagcaaatcc tgactcagta acaacactaa 73320 atctaggcct cttgggctgg atgcagcggc tcacgcctgt aatcccagca ctttgggagg 73380 ccgaggcagg cggatcacct gaggtcagga gttcaagacc aacctggcca acatagttaa 73440 accccatctc tactaaaaat acaaaaatta gctgggcatg gtggcaggcg cctgtaatcc 73500 cagctactcg ggaggctgag gcaggagaat cgcttgaacc caggaggcgg aggttgcagt 73560 gagccgagat tgtaccattg cactccagcc tgggcgacaa gagcgaaact ctgtctcaac 73620 aaaataaata aataaaaata aacctgtgcc tcttcaatcc attcttttac agggggccat 73680 gtggacacat ccctgttcca agtggcccac tttaaagagg aggggatgct cctggagaag 73740 ccagccaggg tggagctgca tcacatagtt ctggaaaacc ccagcttctc ccccttggga 73800 gtcctcctga aaatgatcca taatgccctg cgcttcattc ccgtcacctc tgtggtgttg 73860 ctttaccacc gcgtccatcc tgaggaagtc accttccacc tctacctgat cccaagtgac 73920 tgctccattc ggaaggtgac actaagagcc agaaaggctg ggccacggtg ggttgacggt 73980 aaacacaaaa tgcagccaga gagcccccta gagggtcttc aactgggacc aaggatgggc 74040 ctcttgtaca ggagaacctt gaagtgggca aacaattcgt cccctctcag tcttggactg 74100 gtctcctgcc tctccaaggt atccccctgc taggttcatt gatcaatcac tgatctttgg 74160 acagtttctt acggcccctg tccaggaatc aaaggattgg agtcctccat gcctaaaact 74220 gtagcctttc ctcattctaa cttccctaac aataacccta ttcctatcct gaagacatca 74280 aagaccaagt gcgggctgtc tctgtagcag tgcatggggg gagggaggca ggggtgatat 74340 agccccactc tgtccacaca aacacacaga tgcaccatcc cccctattca aaccacattc 74400 catggggcag aagcctccct taatcattgt tccctatgtt ctgtaggcca tagatgatct 74460 agaaatgaaa ttccagtttg tgcgaatcca caagccaccc ccgctgaccc cactttatat 74520 gggctgtcgt tacactgtgt ctgggtctgg ttcagggatg ctggaaatac tccccaaggt 74580 gagcagccca gtgtctgcct cttcactgcg aacagaagtc ctacctggga atgaatgcct 74640 gtggggcagc tctcagggct ttctcaggga caggataaat cgaggttgaa cagaaagtga 74700 gtgggattta gagaaagtgg gaatgcttcc aagactggga gaaagaaagt tcggaggagg 74760 gaggatcaca tgcaggatgt gaaggatgat ggggtttgac ttgggggcac tgggaagtgt 74820 ttgggctata ggtcatcaat gtccttcagt ccagaccacc aggaacaaac ctgtagttga 74880 acatactggc tttattaatc attgcagtgg cagaacacat attatgggga accatggggt 74940 gtctcagtaa agggtgtttg aaagaatctg ctataggatt tgggccttgg ttgcatacat 75000 ttgaggaaga tctaaggaaa cgaagatttt ctttagattg ggtgttggtt caggccacac 75060 ggagatttgt ggaagaacat ttcaggcaga aggaatgtgt gcagaggcat agaggtggga 75120 gcaagcttga cgcctccaga tgacaaaaaa gaggtcagta tagcaaaaga tgagtcagat 75180 tagaggcagg gtcttaccag gcctcatctg ccctgttggc cctggtatgc actttgcact 75240 ttgttctaag tgcaatggca agcttttgag ggtcataaac agctaataca agacctgatt 75300 tacagccagg cgcggtggct cacgcctgta atcccagcat tttgggtacc gaggtgggca 75360 gatcacttga ggtcaggagt tcgagaccag cctggccaac atggtgaaac cctgtctcta 75420 ctaaaaatac aaaaattagc tgggtgtggt ggtgtatgcc tgtaatgcca gctactcagg 75480 gggctgaggc aggagaatca cttgaacctg ggaggcagag gttgcagtga gccgagagtg 75540 caccactgca ctccagccta ggtgatagag caagactctg tctcaaaaaa aaaaaaaaaa 75600 aaaaaaaaaa gacctgattt actttcataa agatgcctct ggctgctggg tgagaacgaa 75660 tgataagcag gcaatagtga aagcagagag acaagtcaaa aggccattat gggccaggcg 75720 cagtgggtca cgcctataat cccagcactt tgggaggcca aggcgggtgg atcacctgag 75780 atcaggagtt tgagaccatc ctgaccaaca tagtgaaaac ctgtctctac taaaaataca 75840 aaaaattagt ggggcatggt gagaggcacc tgtaatgcca gctactcggg aggctgaggc 75900 aggagaattg cttgaacccg ggaggcagag gttgcagtga gccaagatca ccccccctgc 75960 actccagtgt gggctacaag agcaaaactc tgtctaaaaa aaaaaaaaaa aaaggccagg 76020 tgtggtggct cactcctgta atcgcagcac tatgggagtt caaggcaggc agatcaccta 76080 agatcaggag ttccagacca gcctggccaa gatggcaaaa cccagtctct actagaaata 76140 caaaatttag ccgggagggc cgggtgtggt ggctcaagcc tgtaatccca gcactttggg 76200 aggctgaggc gggcggatca caaggtcagg agatcgagac catcctggct aacacagtga 76260 aaccccatct ctaccaaaaa tacaaaactt tagccgagcg tggtggtggg tgcctgtagt 76320 cccagctact tgggaggctg aggtaggaga atggcatgaa cctgggaggc ggaggttgca 76380 gtgagccaag attgtgccac tgtgctccag cctgggcgac agagcgagac tccgtctcaa 76440 aaaaaaaaaa aaattagccg ggagtggtga tgggcacctg taaaccctga tactcaggag 76500 gctgaggcag gagaattgct tgaacctggg aggtggagtt tgcagtgagc cgagatcgtg 76560 ccactgcact ccaacctggt caacagagca agaccctgtc tcaaaaaaaa aaaaaaaaaa 76620 tgctgttgtg gttgggtaca gtgactttcg cctgtaattc catcactttg ggaggccata 76680 gtgggagaat cacttgaggc cagtagctca agaccaacgt gggccatgta gggagacctc 76740 atctctatga aaaatgaaaa aatgagtatg gtgatgcacg cctgtagtcc cagctactca 76800 ggaggctgag gcacttgagc ccaggagttc aagtccagcc tgggcaacat ggaaaaaccc 76860 catctctctt taaaaaaaaa agaaaaaaga gacaatagag agaaagagaa aaagacaaaa 76920 ggctgttgca atactgcaag ctacagttga aggtcacttg actaggaagg tagcgaggga 76980 gatggagaaa agtatatgaa ttcagggttg attttgaagg cagagccaac aggacctgct 77040 ggattgcctg tgtagggaag gacttaagca tgattccgag tttggagcct gcatgctcag 77100 tggatggtcc tatcctttac tgagtttggg gaaactggaa agggagcatg tttggaagac 77160 agtggggaat caagagttct gtcttagctg ggatttagga aaggggatgg atataaattt 77220 ttgtcaatcg acagtgggtg gagtttaggg tccattggga agacaggctt tgggctggag 77280 aaatgagggt ttggattcaa gatgggggag gtgaaggaag aaggatctgg cttaggatta 77340 tgagcaatac gtttaaggac aagccctgac ttccccgtga ggaaatcagg ggctcctcat 77400 ccttctgtgg aaaattggga aaagagcaat gtctgaaggt gatggttgat ttcagtacag 77460 ggatcatcag agtgtcctta tggagccaag cagggccaag aaggtgtcag gtggggaaag 77520 accttccatg caagatgaac agggtggcaa atcaactctt tgcctcttca ggaactggag 77580 ctttgctatc gaagccctgg agaagaccag ctgttctcgg agttctacgt tggccacttg 77640 ggatcaggga tcaggctgca agtgaaagac aagaaagatg agactctggt gtgggaggcc 77700 ttggtgaaac caggtaaatc caggacaatt ccagagaact cagaggtggg aggagaaaga 77760 aaatcaatga agttgctggg cacagtgctc acgcctgtaa tcccagaact ttgggaggcc 77820 gaggcaggca gatcacaagg tcaggagttc aagaccagcc tggccaatat ggtgaaaccc 77880 catctctacc aaaagtacaa aaaattagcc aggcgtggta gcgcgtgcct gtagtcccag 77940 ctactcagga ggctgaggta agcaatggcg tgaacctggg aggcggagct tgcagtgagc 78000 tgagatcgca ccactgcact ccagcctgga caaaagagca agactgcaat ccaaaaaaaa 78060 agaaaaagtc aatggaagaa agtctgtgta tgtgggaggg gggggggtgc agagtgcact 78120 atcacagttt attgatgaga agaaatagcc cacaggaggt aggaggtagg tggctttctc 78180 aagttacata acagttgtta tgactagaat ttaaacttga ctcaatctga ccctctaaaa 78240 tttatgttct ctcccatgca caatggtgtc ttcacagtgc ccttgaatta acagatgtcg 78300 acatacacaa agcttttagc taagtttctc atgatagttt gggtgtgagg tggaagaata 78360 agacctagat agtcatatgg tcgtatgggt taaaccgatt tggcaatttc agatactctc 78420 ccctgtatcg ctcagtattt ttgaggtagg caagaggaaa tcaaataata aattttctgg 78480 gatagagaaa atttatggac tcttagtcta gtaccaacaa tctaagaacc caaaccccca 78540 aacagtcctt ctaggacaac catccctagt cagctctcaa aaaatggtga atctacctct 78600 gagctgcaac aagctggctt cacagtgtag tggaaagagg atgggcttta ctgaatcctg 78660 gcttgggtgc aaccacagat ggagaactaa ccagccccag cttcagacct ggattcaaat 78720 cccactctgg cacttacttg ctgagtgacc ttgggaaaat tacttaattc atttgagtcc 78780 ctcgtctgta aaaagaggat cataatgact aatttgtagg aatgctggaa agattaaaca 78840 aagcgatgtc tatacatgcc tcatgcagta atccttaaca aatcgtagca atgttatctt 78900 tttcctcatg ttaagccaat ctctgcctct tatcagtctt aattctgtcc cttggaactc 78960 atagaaaaag tctatctcct tccccttaaa tctaaaaagc catttagatt ctcaaagact 79020 gtggtcatag cattcctgat gactcttgtt atgcaagtta gacatctcag ctccttccat 79080 cattcctctt ctattttggt ttgtaaattc tttgccatcc agtcacctcc ctcagtagat 79140 gctccagtgt gggaatgagt cagaattgac ccctggactg tccagtcagc agagagcatg 79200 taaacaccct cctttctcca cctacatgga cccggtgctc ctggtaccac aacctgaact 79260 ccctcccctg tgcccacctc cctgaactgt gagatcacac tgaacttaga gcctgctcag 79320 actctcaggc cttgtacatg cgagctgctg tgaacttccc ctttctctcc tttgcaaagc 79380 aatttttaag tcccttcatt tttgtcgttg tctgtttttt ctgttaaaat taatttgtta 79440 ttcccatgtc ctaagattgc ttgatcttga ttctagattc tgacatccaa atccagaacg 79500 tacacctttg gttggtgcca gtctagaagg ctaaacaaga acaaggaagt agatttctgc 79560 tcagacaagg aggtgctaaa tagttcaagc tatctagcaa ggggtttggt tagttcacca 79620 gggatacaat tcaggccaac acgggattaa aaccaagtct gggatgttat aggagaaatt 79680 gctatcgctc atttggatgt cttctattac tgtatgatca cagcttctga gcctcagttt 79740 ccccatatgc ataaggaagg gattcaccta aataacctct aaggaccctt ccagctctta 79800 ctccagattc taccagaatt tgtctgtggc ttttttctaa atcaaagaat ctcagaagtg 79860 agtgggagct tggaggtgta ccatgtgtgt taataactta gcactgtttc agatgcatct 79920 tcatctctgt tttcctccag gagatctcat gcctgcaact actctgatcc ctccagcccg 79980 cataggtcag taacagctct cagagaccac aggaggcggt ggcagttctg gggtactgtc 80040 tggttctggg ggaagggaag agagaaagga agggactcat cactgcaaac ctcacaaaga 80100 gtcaggccct ggagagcggt gcatggtcgc tccccatacc cccttgtcct atccacagag 80160 accctagaat tccttatctg acagcctcca atgtgcccag caggaacccc actcccgatc 80220 cccactctgg cagaatgcca agctagatgc tgccatgagg gaggacacag gtgcatttct 80280 ggaggcagaa ggaagcgagg gatgcagaga tgagagagga aagcactgaa ggtggggatt 80340 ggaggtggag gctgtggttg gagcccttct ttgttcgttg ccagccgtac cttcacctct 80400 ggatgccccg cagttgctgc actttgtgga ccagtatcga gagcagctga tagcccgagt 80460 gacatcggtg gaggttgtct tggacaaact gcatggacag gtgctgagcc aggagcagta 80520 cgagagggtg ctggctgaga acacgaggcc cagccagatg cggaagctgt tcagcttgag 80580 ccagtcctgg gaccggaagt gcaaagatgg actctaccaa gccctgaagg agacccatcc 80640 tcacctcatt atggaactct gggagaaggg cagcaaaaag ggactcctgc cactcagcag 80700 ctgaagtatc aacaccagcc cttgaccctt gagtcctggc tttggctgac ccttctttgg 80760 gtctcagttt ctttctctgc aaacaagttg ccatctggtt tgccttccag cactaaagta 80820 atggaacttt gatgatgcct ttgctgggca ttatgtgtcc atgccaggga tgccacaggg 80880 ggccccagtc caggtggcct aacagcatct cagggaatgt ccatctggag ctggcaagac 80940 ccctgcagac ctcatagagc ctcatctggt ggccacagca gccaagccta gagccctccg 81000 gatcccatcc aggcgcaaag aggaatagga gggacatgga accatttgcc tctggctgtg 81060 tcacagggtg agccccaaaa ttggggttca gcgtgggagg ccacgtggat tcttggcttt 81120 gtacaggaag atctacaaga gcaagccaac agagtaaagt ggaaggaagt ttattcagaa 81180 aataaaggag tatcacagct cttttagaat ttgtctagca ggctttccag tttttaccag 81240 aaaaccccta taaattaaaa attttttact taaatttaag aattaaaaaa atacaaaaaa 81300 gaaaaaatga aaataaagga ataagaagtt acctactcca taggcacagc agtcccgact 81360 ggctgctggt tggctatttt tgtggttatt tcttgatcgt gtgctaaaca aggagtggat 81420 tattcatgag ttttccagag aaaggggtgg gaattcctgg aactgaaggt tcttccccct 81480 ttcagactac ttagggaaac ttccagatgc tgccatggcg ttcgtaaact gtcatggcgc 81540 tggtgggaat gtcttttagt atgctaatgt attaaaatta gcatataatg agcagcgagg 81600 acaaccggag gtccctttca tcaccatctt ggtttgatgg gttttggctt ctttaccaca 81660 tcctgtttta tacgaggagt ctttgtgaaa ccagtcctgc caacctccta tcttatcccc 81720 acctcagaca ttacatactc gtccttaatc ttaagaggtt gtagaaggac agagatctat 81780 cttctgtcac tgcttcatgc tgaacagggg ccatcatccc tacctgctct ggaggcatag 81840 aaatgtcttc gtacctgatc gaatgactta tagggactcc atcttttctg cagtggcaca 81900 ggctggaaca ctggactaat tatttttctc tgaaactgtt gtagcctgga agacataaac 81960 ttgaaccgtc cgaacacctg atgaatataa caaatatttt taaaatcaaa tagcacctag 82020 tgataattca taaaatactt aagcccagtt ttaaagaaga gatcaaagcc aagcatggtg 82080 gctcatgcat gtaatcttaa cactttggga ggccaaggca ggaggatcac ttgagctcag 82140 gcttttgaga ctagcccggg caacatagac tttgtctctt taaaaaaata agaaagaaaa 82200 aaataagaga tcagatgcca tttggaatct aagtgaatag gctggaaaac tagtcttctg 82260 ttgtctgaaa tacaaagcca aacagatttt taataagaga gatttttatg agaacagaag 82320 aaaaaaaggt taatgctgga cacagtctat ccaggtatta gactcaaaag catctttagt 82380 tataaagcag gaaagtggtg gcaatctgac acgcctgtat cacaaggtat aagctttagc 82440 ttgcagggcc tcaggagaaa gctagtagta attttactgt atgcagatct ctaagctggg 82500 ggtgacacag gaggtgggca ggactggctt cacaagatgc aggtcacaaa gactccactg 82560 ataagatgcg gtaaagaagc tggccaaagc ccaccaaagc caagatggca atgaaaggga 82620 ctcctggttg tcctcactgc tcattatatg ctaattttca ccctcctgta ggaccagctc 82680 cagcagttgt gggacccagt gcaaaatgga aacacagggc ccctcgttca aaaatgataa 82740 gaatgtcaag acggtgacaa cagaacatga aatcaagcgc caggccctgc tgagctcagg 82800 gctctgtgtg accgcacagg tcacacaccc atgagactgg ccttgtcttt cgggtccttc 82860 acgccccacc cacccatctc ctttaccaca tgcagacctc cactctccac acaacccttc 82920 ctccaagagg acagtcaatg ggaagtaaga agatttaaaa gacaaggggc aaagaaaaat 82980 gcaatggtgt gtttccattg tagaaaacct ggtcgtggaa ttgcggattg ccctgccgcc 83040 cttgaaaatc aagatatggg cactagaaga tgttacaagt gtgggtccac agatcacgaa 83100 ataaccaagt gtaaggctaa agtagacccg cctcttggcg aatgtccttt tgcagaatgt 83160 tttgtctgtg gagaaatggg gcacctgtct agatcttgtc ctgataatcc caaaggactc 83220 tatgctgatg gtagtggttg ccaacttcat ggctctgtgg aacatttaaa gaaagattgc 83280 cctgaaagtg agaattcaga tcgaatggcc acagttggtc tgtgggcaaa gggaataagt 83340 gcagactatg aagacattgt ggatgcacca aaaccacaaa aacccaaaac aaaaatacct 83400 aaaggtgtta atttttgata acaactagta ctgtcattag ttaccacctc attgttactt 83460 tctaaaccag gcccacttca caagttacag ctgggctccc ttgtagccag gactatactg 83520 taaatatcag tatgatctgg gtgtggtcaa aaacaatttt caacagagag aaagaagaaa 83580 gaagagagag aaagaaaaag gaagagaaag aggaagagga aggaatgaag aaaaagagga 83640 aggaaggaag gaaagagaga aaggaaagag aaagaaagaa gaaagagaaa gagaaagaaa 83700 aagaaaagag aaagggaggg agggaaagaa ggaaggacct acgcgaggag gggctccgcc 83760 aaccctctaa gccccgcaac ccccgcatgc cccgcctcct tctgcagacc tccccgtccc 83820 tcatcccagc aagtcctcac gttagttgcc ggaagcctag gaaatgtatt cctaaatctg 83880 gagtcgtctc catggcagcg agccccgaag tggagccagt cttggagtag gagaaaggcc 83940 agacacaaac aagtgtgtgg ggatggggga gacacacatg catgtccgct ttggatccag 84000 attctctttt ttttttttag acggaggctc gctctgtcgc ccaggctgga gtgcagtggc 84060 gcaatctcag ctcactgcac tcactgcaac ctccacctcc cgggttcaaa cgattctcct 84120 gcctcagcct cccgagtagc tgggattaca ggcgcccacc accacacccg gataaatttt 84180 tgtattttta gtagagacgg ggtttcacca tgttggccag gctggtctcg aactcctggc 84240 atcaagtgat ccgcccgcct cggcctccta aagtgctggg attacaagtg tgagccaccg 84300 tgcccggcct ggatctggat tctaatagca acagctagaa ttctgaggga caggcagtga 84360 gctgggaatc aaattcttgg ctctaccacc cattagatgt gtgatcttga aacaggtttt 84420 cagatgtgta agaagggaat agcaaagctt gcgtcaaacg gttgtttgac taggtgagct 84480 catgagtggg aagttgttct gggcgagctg gtacacaaca ctaggcagac agtaaatctt 84540 agcttctttg agcctgaggc tgttgtaggt gccagtgata acagaggcca gagtatctat 84600 tctcaagcaa tttatattct agtggcaagg agatacgaaa tacgcaccca tgtctgtaat 84660 cccagcactt tgagaggcga aggcctgaag atctattgag accaggagtt tgagaccagg 84720 agttcaagac cagcctgggc aacatagcaa gaccccgtct cttaaaaaaa aaaatttttt 84780 ttcatttttt tggccagatg cagtggctca tgcctgtaat cctagcactt tgggatgtca 84840 aggcggacgg atcacctgag gttaagagtt taagagttca agaccagcct ggccaacatg 84900 gtgaaactct gtctctacaa aaatacaaaa attagctggg catgatggca ggtgcctgta 84960 accccagcta ctcgggaggc tgaggcagca gaattgcttg tacgtgggag gcacaggttg 85020 cagtgagccg agatcacgcc attgccctcc agcctgggca acagagcgag cctctatctc 85080 aaaaaaaaaa accagttttt ttaattagcc gggcatcgtg gtacatgcct gtattcccag 85140 atactctgga ggctgaggca gaagggtccc ttgagcctag gctccagtga gccataatca 85200 tgccactgca ctccagccct ggagacagag caagatcctg tctctaaata aaaataattg 85260 gctgggtgct tacctataat cctagcactt tgggaggctg aggccaatga attgcttgag 85320 tccaggagtt tgagaccagc ctgggcagca tagtgagaca ctgtctctat ttaaaaaaaa 85380 aaaagaagaa gaagaataga agaaagatct ttttgaatag agatagatgg caacttcctc 85440 agaatggatg tgacggttga acagcacatg gagatgatgg gaatgagaag tgcagggcag 85500 cagggatcat tctcaacaat ggtggaaaat tggagtcttc caggatcact gtgttcagac 85560 taatccaccc tacatgaagt tttctttctc tttaaagtgg gccacttgga acagggaccc 85620 ctgtcaaaga atgcattcaa gaatcatgga tttggctggg cgaggtgact cacgcctgta 85680 atctcaacac tttgggaggc tgagacggat gaatcacctc aagtcaggag tttgagaccg 85740 gcctggccaa catagcaaaa ccccatctct actaaaaata caaacaaaat gagctgggcg 85800 tgctggctca tgcagtgagc cgagatcacg ccattgtact ccagtgtagg caacaagagt 85860 gagactccat ctcagaaaaa cggaagaatc atggaatgct tccaccaggc actctgatgg 85920 atggtgggag ccctgggctg agcttaatgc ctttcctgaa ggagtcactc tcccaaggag 85980 taggatttgc caggtgctag aatccgatag aggccatgag caatacttgt ctcaatggtg 86040 ggataactac agatttaaac tttgagccat gcacacactc tggtccaggc cccagtcctg 86100 gagcaagaag gaagcagggg aaggaagaag gtcctctgga acctgtgatt acctgtgttg 86160 tgtcaggttg tatgattgtg gggaccccat tgagtgctac tcacaggtta gccccaggca 86220 ggtaggaatg agaaggagga gaataggaga agaagaggaa aggctagaca ggagcatgac 86280 tgttggagag aacttgaagg gaagtctagc agtagtatgg acgctgtgct attggtcagg 86340 agtcctgctg tgttcttggt ttcctagcaa ctgctgttaa gcaggtagtt gctaggcaat 86400 gcggggccat ctaggcggtt aggtaaggac ctttgtctgc gtgggcatga atcagaccat 86460 cacagagtat tgttaatcct gaaaatggcc actgggtcca ttacttagat ttttttggcc 86520 ttgttgcaca gaaactaata taaccatggt gcacaagttc tggtcaaatg agaaagaagt 86580 caccccatga tctcattatc caaacaatac aaaaattctg tctttcctct ttccattcca 86640 gtctttctct gcaggaacac attttcatct tgttatcatc ctagtacatg gggttttaaa 86700 ccttgttttt ttgactttga tttagatcat cacaagcatc cctcaagcta cttaagagtt 86760 ttcaggataa tcttcccctg catgactcct tgcagtagat atatcattta cttaactatt 86820 ctcctcttgt tctgacattg agtttgcttc caattttttg ccacaataag cagtactgca 86880 atacacgtct ttctgcagag agcttttcgt tatcctttag atcactgtct tgaagtcaat 86940 actaagggat agtatcaaag ggcataaata gtctcatgac tttctatatt ttgcaatagc 87000 ttccctcatg ctgggtgcag tggctcaagc ctgtaattct cacactttaa gaagccaaga 87060 aggaggatca cttgaggcca ggagttcaag gccagcctag gcaacatagc aagactctgt 87120 ctctacaaaa caaatttatt ttaattttta tttttctttt gagacagagt ctcgttcggt 87180 cacccaggct ggagtgcagt ggcgcaatct cagctcactg caacctcggc ctcctaggtt 87240 caagtgattc tcctgcctca gccatctcag tagctgggat tacaggtgca caccaccata 87300 cccagctaat tttttttttt ttctgagacc gagtctcgct ctgtccccca ggctggagtg 87360 caatgacacg atgtcagctc accgcaacct ccgcctccat ggttcaagca attctcctgc 87420 ctcagcctcc tgagtagcta ggattacagg cgtgcgccac cacgcctggc taatttttgt 87480 atttttagta gagatggggt ttcaccatgt tggccaggct ggtctcgaac tcctgacctt 87540 gtgatccgcc cacctaagcc tcccgaagtg ctgggattac aggtgtgagc cactgcgcct 87600 ggccgctggc taattttttt cttttttgag acggagtctc gctttgttgc ccaggttggg 87660 gtgcagtggc gcaatctcgg ctcactacaa cccccgcctc ccgagttcaa gcgattctcc 87720 tgcctcagcc tcctgagttt ccagtaactg ggactacagg cacgtgccac catgcctgga 87780 taattttttg tatctttaat agagacgggg tttcactgtg ttagccagga tggactcgat 87840 ctcctgacct cgtgatctac ccgcctcggc ctcccagtgg gattacaggc gtgagccaca 87900 acacccagcc aatttttgta gttttagtag aggcagggtt tcaccatgtt gcccaggctc 87960 gtctggaact cctggactca agcaatctgc ccacctcggc ctcccagagt gttggagtta 88020 caggcttgag ccaccatgcc cggctgaatt tcttgattct ttaatggctg gtttttgttt 88080 tcttttgatt ttagaaggcc ttaatttgaa gacctttaat ttaatttaat ttaattaatt 88140 taatttattt gaagacagtt tgagggagaa caaaatgtcc tcacagttat tttttcatta 88200 ttttagttga atatatctat aatatagttt atgattacaa ccatttttaa gtgtaagcac 88260 attcacctcg acgtgcaacg atcaccactg tccatctcca gaattttttc atcttcctta 88320 attgaaatta tgtacccatt aaataataat tccccattcc tgcctccttt cagcccctgg 88380 gaaccactat tctactttct gtctctttaa atttgaccac tctagaaacc ttatataagt 88440 ggaatcatac aatcattgtc cttttgtgct tgcttctttc acttatcaca aagtgttcaa 88500 ggtttatcca tggtattgca tgtgtcagaa tttttttttt tttttttttt tttttgagat 88560 ggagttttgc tcttgttgcc caggctggag tgcagtggtg cagtctcagc ttactgcaac 88620 ctccaccacc cgggctcaag ccattctcct gcctcagcct cccaagtagc tgggattaca 88680 ggtgtcgacc atcatgccca gctaattttt tttttttttt gtatttttag tagagctggg 88740 gtttcaccat gttggccagg ctggtctcaa actcctgacc tcaggtgatc ctcccgcctc 88800 ggcttcccaa agtgctggga ttacaggcgg gagccacctc acccagccag aatttccttc 88860 cttttaatgt tggaaaacat cccattgtat gtctatgcca cattttggtt atctgttcat 88920 ctgttgatgg acacgtgtgt tgcttccacc tttcggctat tgtgagtaat gctgccatga 88980 acataggtgt acagatatct ctttgaaacc ttgttttcaa ctcttttggg tatataccca 89040 gcaggggaat tgctggatta tcctcacagt tcttaaccat ttagtgtggg tcagcttctg 89100 ggtggataga gtaatggaag gagaagaatg atgttgctga attggctgtt tgatgatttt 89160 tttttttttt tttgagacag acttttgttc ttgttgccca gactggagtg caatggtgcc 89220 atctcggctc actgcaacct ccacctctta agttccagcg attctcctgc ctcagcctct 89280 cgggtagccg tgattacagg cgcccacccc catgcccagc taattttgta ttttgaatag 89340 agacggggtt tcaccatgtt ggtcaggctg gtctcgaact cctgacctca ggcgatccac 89400 cctcatccgc ctcccaaagt gctgggatta cagacatgag ccaccacacc tggccctgtt 89460 tgatgatttt tgttctatag gagcggcaca aagagaagag gacagaacaa aagtaaccaa 89520 atctattctc tctgaggaag atcatatgta gagagcagag gtctgtgtgt gtgaaagaaa 89580 acggggaaga ggatgaaaat gtccagagtg gtgagctctg ctaggccctg gaatgaagct 89640 gagatgctgg aagcgctaag ttctctctag tgggagaaaa cccactgtgg aaagctaatc 89700 atttgatgta gaacctcacc aacgaccagg aagaagcaat cagtgcccag ctaggctaaa 89760 ctgggcctgc gatagggtgt gagtagggcc agcttcctgg aagtgcaacc tgtgcaatgg 89820 cagtaggccc ccatgcttgg ggtttaatgc gtggcagtca gtgccttgaa actcttaata 89880 attgtatctt tgaatttgtg ctttgtaagt gaagtttgat gggacctgga gcatgcatag 89940 gggcaggaag ggagggtagg ggggcttgga gccacagctc atgtggaatc ccacctgcca 90000 ggtgccacct cacagcgccc agacaggttc ttagcttccc actgtctgcc ccgtggagcc 90060 ccaggcttta ctcagcctcc ccttctctgc cttgctcagc aaccattgct acccctaacc 90120 ccagcagggg attgagtgga gcacggggat gatcagcatt ggatgtgagc tcccagaagc 90180 ctctcagggt ggggcaggca gctgaccctg cattgggcta gaagtgccac agcaccttca 90240 gcaggtgact aggcttatcc aggcacatcc tagtgtagag gttgcaatac acttgcaggt 90300 ttcccatcaa ctgtgggttt gggtagagga cctgtggaaa ggggagattc ctggcttgac 90360 ttcccttctc ccagccaggg cacgcatata aattagaagt tggcaggaga ggaattcggc 90420 agccatggtt tccaggataa caaatagcta tagcccaggc agcagagatt cccagaaggt 90480 ccttctctcc cacccaccca ctgacacata caccccacca ccaccaccac caccaaagcc 90540 tgtttcctca gctgggcatg aacacttcga gctaaaagtg gcctttgtga attttgcctc 90600 ttgctctgtg tacacattct aaagggaaca agtctgtttt gctgcagggt gaggcccaga 90660 atgcggcctg ggaaaggcta acagggcaca gtctttgatg gtgagcaagc ctggggtggg 90720 tagctgaggg gattgggagg tatacctctt acccttgggt cagagagatg tgggttagaa 90780 ccccagcagt cagtgtttta ttctattcta ttctattcta ttctattcta atttttttga 90840 gacagagtct cagtctatca cccaggctgg tgtgcagtgg tgtgatctca gctcactgca 90900 acctccgcct cctgggttca agcgattctc gtgcctcagc ctcccaagta gctgggatta 90960 caggcatgtg ccactatgcc cggctaattt ttgtattctt agtagagagg gggttttgcc 91020 atgttggcca gactggtctc gaactcctga cctcaggtga tccacctacc tcagcctccc 91080 aaagtgctgg gattacagat gtgagccacc acacccggcc cccagcagtc agtattagtg 91140 gcagatttta gattccttca cctctcagcc tcagtttcct cattggaaca ttggagaaaa 91200 tactggtgcc taatttaggg ggtcatgaaa attaagtgag accatctctg cagaaatgtt 91260 gagcacagtt cccagcacac agtcggtggc cagtgattgg ctgctgctgc tgctgctgct 91320 atcatccatg tgtgtcctag gattcctcat agagggggtc accaacatga tcaactggcc 91380 attgtccaga tccaggcttt ggcctggcag actcaaatct ctgattccta ttgactcaga 91440 cagtgtatct ttcttccaga ggcaaatacc cctaaatgga tgaaggaaac ctatcaaagt 91500 atagcaccgg agaaaagatc cccctagaag aggcctctct cacctctgag aggggtatgt 91560 actctgggca ggtcccagaa gcaggagcta aaggtagcct ttgtccattc agggtttttt 91620 tttggtggat tctggctcca ggccctggac tcatgaggtt tttcctggaa cgctctggtg 91680 tcgagcagag tgtgaagttg tagcttggga tcttccagta tctgtcctgt cagggatgtc 91740 tagtggtcct gaagtccctg ggctgggacg aagttagctc ttgttgctgg gcataggcat 91800 agcttgcacc aggatgaggc catggagtca gccaagctct ctcaattcct gacagacaga 91860 gcccccaggg gtcacctggg ctgggaccag cactgttttg tgatatattc atttctgtac 91920 aggtagacca ggtaaaaaat aataaggtgt cctagagaca cctgggcctc tgctgtctga 91980 gctccggatt tagcctcaca tcttctcaat atgcaaagga ccctgcagtt actaaagcat 92040 ttttaacacc tgcactcctc actgacaccc agctcctccc caataagcct ttgcatggtt 92100 ctgcccgaga ccctcttggt tactgagact tccccactac ctactcccac ttaagctgca 92160 taggtaggtt tgggtgactt tctcaatgcc acagccttac ataaagctct tccatcccac 92220 tctacacacc cttccattcc accttgaaaa ccctctcatt gcaccctaca caccctttca 92280 ttctagcctc acatccttcc aatcatccat cacacaattc catttcaccc ttcacaccct 92340 tccattccac cttgaaaacc ctcccattgc accctacaca ccctttcatt ctagcctcac 92400 atccttccat tcatgcatca cacaattcca tttcaccctt cacacccttc cagtccatta 92460 ttcattcaca ccatttcatt ccattcttca cgccattcca ttacaacctt catattcttc 92520 cactccacct tacacacctt atattccacc cttgacactc ttccagtcca ctacatgtcc 92580 ttccgtttct ctcttcacat cttccatttc accctaagca acctcccatt tcacccttca 92640 cacccttcaa ttccaccctt cacacccctc catgacatcc ttcaagctct tccgtgccac 92700 ccttaacatc ctctcattcc acttttcact ctctggtcct tccatgtcct tccattctgc 92760 cttctgcaac cttccattcc attcttcaca ttctttcact ccacctaaac catccttcca 92820 ttccaccctc cacacctttc aatttcaccc ttcatatact tccacccggc ttggcgcagt 92880 ggctcatgcc agtaatccca gcagtttggg aggccgaggc aggtggatca cttgaggtca 92940 ggaagttcaa aaccagcttg gcccatgtgg tgaaaccctg tctctactgc aaatacaaaa 93000 aattagctgg gcatgatggt gtgggcctgt aatcccaggt actcagaggc tgaggcagga 93060 gaatcgcttg aacttgggag gcagaggttg cagtgagccg agaccgctcc actgcactcc 93120 agcctggaca acagagcaag accccgtctc aaaaaaaaaa aaaaaaggaa ctgcaccatt 93180 ttacattctc aacatcacgg atgagggttc catcttctcc tcatccagcc aactggaagc 93240 ccttttagct gaggtggtta agctgtgaga accagagttt gctgctgccg ttattcactg 93300 catcattcat tcagcaaaaa cctgagtacc taccactacc tagtaggctc tggccatgca 93360 agaagtccct ggcaacgctg taggatagag aggaacagca ccctggttac agcgtttcag 93420 catcctggcc ttggttcacc ccggggcctt tcacttgcag gaggcaggga gttctctgat 93480 ggcagtttga gttggatttg ctgcaactaa caaagccggc cagtggacct ctccaatcta 93540 ctgggactca aataaatcca agtgattgtt ctccctccat tccaggaagg aacaccagcc 93600 agccgtggaa cctcaggtgc aacagagacg ccaggagata ctagtgccca gcagcctgcg 93660 gcagtaccaa tgaagccaga gagggcttgg tggatgacaa ggaggcctga gtagaccgca 93720 ggtgggtctg agaaatgggc ttaggtgagg caggtctttg aaggatttgt tcttaatcat 93780 atgcgagatg ctcaaaaggc tggatgcctg cttttgtggg tgaagagcaa gaagagaaaa 93840 caggttgtac acatacagat gcagatggag agacagagaa aaaaaaggaa gaaggcagag 93900 aaatgcacca attcttgagc tgtattatct ctggaccttg ggattgtggg aggctttatt 93960 ttactactga ttttgcctac actgttttct caatttctag ttttctacaa agatgatgtg 94020 ttagcttttt cacgcattaa gattaaaatt taaaacagac cacacaataa atatttcttt 94080 taaaaggaga aatatgccct ggggaagctc cccacgcagc tgagagcctg gctggttgca 94140 tttcggcttc ttaggtgcca agcctactag tgccccgggg tcccagccag cggctcgcgt 94200 ttcctccccc ttgcgccgcc agtctggttg ccatggagac aaaccttgct gcgcagagac 94260 tgccgagccg caggcagcgg agctggaacc cggcgagcca gcctctgcaa caggctgggg 94320 gcggaaggag gagccaggcg aagcggcgcc tcagctgaga ggaccggcgg accctgcaga 94380 ggccccctgc ccctctggct ccgcccccac ccgggtcgct agaaatacag ccgtagcccc 94440 gcccaccgcc cactgcgctc tgacccagac ccggctgacc cacctacccg cgatcctgcc 94500 catggctgac gggctctttc ggcgcagacc ctggggtctc gagcagattc gcccggaccc 94560 cgagtccgaa ggcctgtttg acaagcctcc cccggaagac cctcccgctg cccgcgggcc 94620 caggtcggcg tcggccgcgg gcaagaaggc tggtcggcgc gcgggcggga gggcgcaggg 94680 gggccgcgcc gggcagcccc cgaaggccgc atcgcgcccc ccgcccaaga aggaggcgcc 94740 tccactggac gagggctgct atctcgacca ttttccgcac ctctccatct tcatctacgc 94800 agccatcgcc ttctccatca cctcctgcat ctttacctat atccatttac agcttgcctg 94860 agtggccagc gcgggacggg gtgggcgcag gaccgagcgg ggagggaaag ggaaaacggg 94920 gctcggcatt ttgtgtttta gaacagcgct gcaccccctt catgtagctt tcgatgcttg 94980 tttctttccg tctttgttgt cactatcttt gtctatcagt acgaaagtac aaagtagctg 95040 ccggcaatga aataggggtg ctgtttgcac ctgcaggtta ggggtggagg cgtttagaat 95100 tttggggtgt gattgagccc cgtttataat tagaatgccc ctggacccct accactctgt 95160 gacgtggggg cacgcgcagg gatcccatca ttttgtgttt ggggagctca gagtgcgccc 95220 aatcttggaa tctttaaggg atgagccaga cccagacccg cggccttcta gagagggtcc 95280 ggcagggagg gtcggcgccc tggcccgggg tgggccggag ccctgtgatg ctgcatcgcc 95340 cccaggagga gccagctgtg ccccagagtt ggcgcggccg agagaggaca agagcgcgca 95400 gcaggcgaag ctggagggcg ggactcggta agtggcgttc gtcggggtgt cgtgctgcgc 95460 ccccaggggc tccggctgac cacgactgtg tgtttttcct gccttagact ttgttgtcgc 95520 tgcccggagg agtcgagact ggtacccgga ggagctgtct caccaggaga ccacgtcctg 95580 gaagtgtccg ggactcgcgg gacctgtggc tgcagacccc gccggcacgc aggcccagag 95640 ctggcgcact cctgaggatg agactctggg ggccctagcc ggggtccacg ggagggctgt 95700 ccttggggac tctaggatgg cttcgttctg gcccggctca cttctggagc tgtgagaccc 95760 aagacaaaag gggctgaggg atttctcatt gacaagagtt cgtgcgggaa aaccacctga 95820 tccctaggga tttgtcatct taagactcaa aaggcttaat accaggaacc accttggcaa 95880 gatatttacc caccggccat ctctgtttac tcatgaatgt taaatgttaa aacgcagcgc 95940 tctaaccctg catattattt acttgcaaat gtctgtaatc tgtaattgtg atgcctctga 96000 tggaataaat tatctttttc agtctcctct atattcagtt ctccattaat ttccaactct 96060 tctaacttct tctgccaaac ttcctagtct gctccacagc atccccactg tccctccaaa 96120 acagacttaa tcctgtgacc cagcattcca aaatatagcc tctccctcca ctaaatagct 96180 acaaggccct tcaatggcat ctttctccac gtcaggtcct cacaacatgc cagtatccga 96240 gtctaccatc ccttatggca gatcagccac tatcccattg tggcattacc tactgcccct 96300 cagtatttcg tttccacttg cctccatttt gaagaccgag tacggccccc cagctaagct 96360 ctggaatttt ccccctgggt ctttacaaac gccctgtttt ccaatatcag ggccttgcca 96420 tttctcagtt ctgacttccc ataatcccat atcactctgt gccttgatat cttgcccccg 96480 gacacttgtt tcggtgccac aagtatgaca tccacggttc cccaatgtca gcctgtgctt 96540 tacaatcaca ctttactaca acaattcccc cacagcaatc ggtggcacta tttcccaaaa 96600 ttaggtctcc aggccgggct cgttggctca cgcctgtaat cccacactt 96649 11 6531 DNA Homo sapiens 11 gccccagggc ctggagaggt ctgaagaaac ctgggagcca gcagcccggg gctccactct 60 gggttctgaa agcccattcc ctgctctgcg gctcctccca ccccacctct tctcagcctt 120 gcagctcaag ggttgatctc aggagtccag gacccaggag agggaagaat ctgaggaaca 180 cagaacagtg agcgttgccc acaccccatc tcccgtcacc acatctcccc tcaccctcac 240 cctccctgcc tggccctgga ccccatccca ggacctccct atcagctgac ttcttccagt 300 gtcttgcagg cccctctggg ctcctccctc ccctggcttt tcctaccact ccccctctat 360 cggcgtctat ctgtaggtgc cctgggattt ataaaactgg gttccgaatg ctgaataaga 420 gacggtaaga gccaaggcaa aggacagcac tgttctctgc ctgcctgata ccctcaccac 480 ctgggaacat cccccagaca ccctcttaac tccgggacag agatggctgg cggagcctgg 540 ggccgcctgg cctgttactt ggagttcctg aagaaggagg agctgaagga gttccagctt 600 ctgctcgcca ataaagcgca ctccaggagc tcttcgggtg agacacccgc tcagccagag 660 aagacgagtg gcatggaggt ggcctcgtac ctggtggctc agtatgggga gcagcgggcc 720 tgggacctag ccctccatac ctgggagcag atggggctga ggtcactgtg cgcccaagcc 780 caggaagggg caggccactc tccctcattc ccctacagcc caagtgaacc ccacctgggg 840 tctcccagcc aacccacctc caccgcagtg ctaatgccct ggatccatga attgccggcg 900 gggtgcaccc agggctcaga gagaagggtt ttgagacagc tgcctgacac atctggacgc 960 cgctggagag aaatctctgc ctcactcctc taccaagctc ttccaagctc cccagaccat 1020 gagtctccaa gccaggagtc acccaacgcc cccacatcca cagcagtgct ggggagctgg 1080 ggatccccac ctcagcccag cctagcaccc agagagcagg aggctcctgg gacccaatgg 1140 cctctggatg aaacgtcagg aatttactac acagaaatca gagaaagaga gagagagaaa 1200 tcagagaaag gcaggccccc atgggcagcg gtggtaggaa cgcccccaca ggcgcacacc 1260 agcctacagc cccaccacca cccatgggag ccttctgtga gagagagcct ctgttccaca 1320 tggccctgga aaaatgagga ttttaaccaa aaattcacac agctgctact tctacaaaga 1380 cctcacccca gaagccaaga tcccctggtc aagagaagct ggcctgatta tgtggaggag 1440 aatcgaggac atttaattga gatcagagac ttatttggcc caggcctgga tacccaagaa 1500 cctcgcatag tcatactgca gggggctgct ggaattggga agtcaacact ggccaggcag 1560 gtgaaggaag cctgggggag aggccagctg tatggggacc gcttccagca tgtcttctac 1620 ttcagctgca gagagctggc ccagtccaag gtggtgagtc tcgctgagct catcggaaaa 1680 gatgggacag ccactccggc tcccattaga cagatcctgt ctaggccaga gcggctgctc 1740 ttcatcctcg atggtgtaga tgagccagga tgggtcttgc aggagccgag ttctgagctc 1800 tgtctgcact ggagccagcc acagccggcg gatgcactgc tgggcagttt gctggggaaa 1860 actatacttc ccgaggcatc cttcctgatc acggctcgga ccacagctct gcagaacctc 1920 attccttctt tggagcaggc acgttgggta gaggtcctgg ggttctctga gtccagcagg 1980 aaggaatatt tctacagata tttcacagat gaaaggcaag caattagagc ctttaggttg 2040 gtcaaatcaa acaaagagct ctgggccctg tgtcttgtgc cctgggtgtc ctggctggcc 2100 tgcacttgcc tgatgcagca gatgaagcgg aaggaaaaac tcacactgac ttccaagacc 2160 accacaaccc tctgtctaca ttaccttgcc caggctctcc aagctcagcc attgggaccc 2220 cagctcagag acctctgctc tctggctgct gagggcatct ggcaaaaaaa gacccttttc 2280 agtccagatg acctcaggaa gcatgggtta gatggggcca tcatctccac cttcttgaag 2340 atgggtattc ttcaagagca ccccatccct ctgagctaca gcttcattca cctctgtttc 2400 caagagttct ttgcagcaat gtcctatgtc ttggaggatg agaaggggag aggtaaacat 2460 tctaattgca tcatagattt ggaaaagacg ctagaagcat atggaataca tggcctgttt 2520 ggggcatcaa ccacacgttt cctattgggc ctgttaagtg atgaggggga gagagagatg 2580 gagaacatct ttcactgccg gctgtctcag gggaggaacc tgatgcagtg ggtcccgtcc 2640 ctgcagctgc tgctgcagcc acactctctg gagtccctcc actgcttgta cgagactcgg 2700 aacaaaacgt tcctgacaca agtgatggcc catttcgaag aaatgggcat gtgtgtagaa 2760 acagacatgg agctcttagt gtgcactttc tgcattaaat tcagccgcca cgtgaagaag 2820 cttcagctga ttgagggcag gcagcacaga tcaacatgga gccccaccat ggtagtcctg 2880 ttcaggtggg tcccagtcac agatgcctat tggcagattc tcttctccgt cctcaaggtc 2940 accagaaacc tgaaggagct ggacctaagt ggaaactcgc tgagccactc tgcagtgaag 3000 agtctttgta agaccctgag acgccctcgc tgcctcctgg agaccctgcg gttggctggc 3060 tgtggcctca cagctgagga ctgcaaggac cttgcctttg ggctgagagc caaccagacc 3120 ctgaccgagc tggacctgag cttcaatgtg ctcacggatg ctggagccaa acacctttgc 3180 cagagactga gacagccgag ctgcaagcta cagcgactgc agctggtcag ctgtggcctc 3240 acgtctgact gctgccagga cctggcctct gtgcttagtg ccagccccag cctgaaggag 3300 ctagacctgc agcagaacaa cctggatgac gttggcgtgc gactgctctg tgaggggctc 3360 aggcatcctg cctgcaaact catacgcctg gggctggacc agacaactct gagtgatgag 3420 atgaggcagg aactgagggc cctggagcag gagaaacctc agctgctcat cttcagcaga 3480 cggaaaccaa gtgtgatgac ccctactgag ggcctggata cgggagagat gagtaatagc 3540 acatcctcac tcaagcggca gagactcgga tcagagaggg cggcttccca tgttgctcag 3600 gctaatctca aactcctgga cgtgagcaag atcttcccaa ttgctgagat tgcagcagag 3660 gaaagctccc cagaggtagt accggtggaa ctcttgtgca tgccttctcc tgcctctcaa 3720 ggggacctgc atacgaagcc tttggggact gacgatgact tctggggccc cacggggcct 3780 gtggctactg aggtagttga caaagaaaag aacttgtacc gagttcactt ccctgtagct 3840 ggctcctacc gctggcccaa cacgggtctc tgctttgtga tgagagaagc ggtgaccgtt 3900 gagattgaat tctgtgtgtg ggaccagttc ctgggtgaga tcaacccaca gcacagctgg 3960 atggtggcag ggcctctgct ggacatcaag gctgagcctg gagctgtgga agctgtgcac 4020 ctccctcact ttgtggctct ccaagggggc catgtggaca catccctgtt ccaagtggcc 4080 cactttaaag aggaggggat gctcctggag aagccagcca gggtggagct gcatcacata 4140 gttctggaaa accccagctt ctcccccttg ggagtcctcc tgaaaatgat ccataatgcc 4200 ctgcgcttca ttcccgtcac ctctgtggtg ttgctttacc accgcgtcca tcctgaggaa 4260 gtcaccttcc acctctacct gatcccaagt gactgctcca ttcggaaggc catagatgat 4320 ctagaaatga aattccagtt tgtgcgaatc cacaagccac ccccgctgac cccactttat 4380 atgggctgtc gttacactgt gtctgggtct ggttcaggga tgctggaaat actccccaag 4440 gaactggagc tttgctatcg aagccctgga gaagaccagc tgttctcgga gttctacgtt 4500 ggccacttgg gatcagggat caggctgcaa gtgaaagaca agaaagatga gactctggtg 4560 tgggaggcct tggtgaaacc aggagatctc atgcctgcaa ctactctgat ccctccagcc 4620 cgcatagccg taccttcacc tctggatgcc ccgcagttgc tgcactttgt ggaccagtat 4680 cgagagcagc tgatagcccg agtgacatcg gtggaggttg tcttggacaa actgcatgga 4740 caggtgctga gccaggagca gtacgagagg gtgctggctg agaacacgag gcccagccag 4800 atgcggaagc tgttcagctt gagccagtcc tgggaccgga agtgcaaaga tggactctac 4860 caagccctga aggagaccca tcctcacctc attatggaac tctgggagaa gggcagcaaa 4920 aagggactcc tgccactcag cagctgaagt atcaacacca gcccttgacc cttgagtcct 4980 ggctttggct gacccttctt tgggtctcag tttctttctc tgcaaacaag ttgccatctg 5040 gtttgccttc cagcactaaa gtaatggaac tttgatgatg cctttgctgg gcattatgtg 5100 tccatgccag ggatgccaca gggggcccca gtccaggtgg cctaacagca tctcagggaa 5160 tgtccatctg gagctggcaa gacccctgca gacctcatag agcctcatct ggtggccaca 5220 gcagccaagc ctagagccct ccggatccca tccaggcgca aagaggaata ggagggacat 5280 ggaaccattt gcctctggct gtgtcacagg gtgagcccca aaattggggt tcagcgtggg 5340 aggccacgtg gattcttggc tttgtacagg aagatctaca agagcaagcc aacagagtaa 5400 agtggaagga agtttattca gaaaataaag gagtatcaca gctcttttag aatttgtcta 5460 gcaggctttc cagtttttac cagaaaaccc ctataaatta aaaatttttt acttaaattt 5520 aagaattaaa aaaatacaaa aaagaaaaaa tgaaaataaa ggaataagaa gttacctacg 5580 aaggaacacc agccagccgt ggaacctcag gtgcaacaga gacgccagga gatactagtg 5640 cccagcagcc tgcggcagta ccaatgaagc cagagagggc ttggtggatg acaaggaggc 5700 ctgagtagac cgcaggtggg tctgagaaat gggcttaggt gaggcaggtc tttgaaggat 5760 ttgttcttaa tcatatgcga gatgctcaaa aggctggatg cctgcttttg tgggtgaaga 5820 gcaagaagag aaaacaggtt gtacacatac agatgcagat ggagagacag agaaaaaaaa 5880 ggaagaaggc agagaaatgc accaattctt gagctgtatt atctctggac cttgggattg 5940 tgggaggctt tattttacta ctgattttgc ctacactgtt ttctcaattt ctagttttct 6000 acaaagatga tgtgttagct ttttcacgca ttaagattaa aatttaaaac agaccacaca 6060 ataaatattt cttttaaaag gagaaatatg ccctggggaa gctccccacg cagctgagag 6120 cctggctggt tgcatttcgg cttcttaggt gccaagccta ctagtgcccc ggggtcccag 6180 ccagcggctc gcgtttcctc ccccttgcgc cgccagtctg gttgccatgg agacaaacct 6240 tgctgcgcag agactgccga gccgcaggca gcggagctgg aacccggcga gccagcctct 6300 gcaacaggct gggggcggaa ggaggagcca ggcgaagcgg cgcctcagct gagaggaccg 6360 gcggaccctg cagaggcccc ctgcccctct ggctccgccc ccacccgggt cgctagaaat 6420 acagccgtag ccccgcccac cgcccactgc gctctgaccc agacccggct gacccaccta 6480 cccgcgatcc tgcccatggc tgacgggctc tttcggcgca gaccctgggg t 6531 12 5100 DNA Homo sapiens CDS (160)...(4449) 12 tgggagccag cagcccgggg gctccactct gggttctgaa agcccattcc ctgctctgcg 60 gctcctccca ccccacctct tctcagcctt gcagctcaag ggttgatctc aggagtccag 120 gacccaggag agggaagaat ctgaggaaca cagaacaag atg gct ggc gga gcc 174 Met Ala Gly Gly Ala 1 5 tgg ggc cgc ctg gcc tgt tac ttg gag ttc ctg aag aag gag gag ctg 222 Trp Gly Arg Leu Ala Cys Tyr Leu Glu Phe Leu Lys Lys Glu Glu Leu 10 15 20 aag gag ttc cag ctt ctg ctc gcc aat aaa gcg cac tcc agg agc tct 270 Lys Glu Phe Gln Leu Leu Leu Ala Asn Lys Ala His Ser Arg Ser Ser 25 30 35 tcg ggt gag aca ccc gct cag cca gag aag acg agt ggc atg gag gtg 318 Ser Gly Glu Thr Pro Ala Gln Pro Glu Lys Thr Ser Gly Met Glu Val 40 45 50 gcc tcg tac ctg gtg gct cag tat ggg gag cag cgg gcc tgg gac cta 366 Ala Ser Tyr Leu Val Ala Gln Tyr Gly Glu Gln Arg Ala Trp Asp Leu 55 60 65 gcc ctc cat acc tgg gag cag atg ggg ctg agg tca ctg tgc gcc caa 414 Ala Leu His Thr Trp Glu Gln Met Gly Leu Arg Ser Leu Cys Ala Gln 70 75 80 85 gcc cag gaa ggg gca ggc cac tct ccc tca ttc ccc tac agc cca agt 462 Ala Gln Glu Gly Ala Gly His Ser Pro Ser Phe Pro Tyr Ser Pro Ser 90 95 100 gaa ccc cac ctg ggg tct ccc agc caa ccc acc tcc acc gca gtg cta 510 Glu Pro His Leu Gly Ser Pro Ser Gln Pro Thr Ser Thr Ala Val Leu 105 110 115 atg ccc tgg atc cat gaa ttg ccg gcg ggg tgc acc cag ggc tca gag 558 Met Pro Trp Ile His Glu Leu Pro Ala Gly Cys Thr Gln Gly Ser Glu 120 125 130 aga agg gtt ttg aga cag ctg cct gac aca tct gga cgc cgc tgg aga 606 Arg Arg Val Leu Arg Gln Leu Pro Asp Thr Ser Gly Arg Arg Trp Arg 135 140 145 gaa atc tct gcc tca cac ctc tac caa gct ctt cca agc tcc cca gac 654 Glu Ile Ser Ala Ser His Leu Tyr Gln Ala Leu Pro Ser Ser Pro Asp 150 155 160 165 cat gag tct cca agc cag gag tca ccc aac gcc ccc aca tcc aca gca 702 His Glu Ser Pro Ser Gln Glu Ser Pro Asn Ala Pro Thr Ser Thr Ala 170 175 180 gtg ctg ggg agc tgg gga tcc cca cct cag ccc agc cta gca ccc aga 750 Val Leu Gly Ser Trp Gly Ser Pro Pro Gln Pro Ser Leu Ala Pro Arg 185 190 195 gag cag gag gct cct ggg acc caa tgg cct ctg gat gaa acg tca gga 798 Glu Gln Glu Ala Pro Gly Thr Gln Trp Pro Leu Asp Glu Thr Ser Gly 200 205 210 att tac tac aca gaa atc aga gaa aga gag aga gag aaa tca gag aaa 846 Ile Tyr Tyr Thr Glu Ile Arg Glu Arg Glu Arg Glu Lys Ser Glu Lys 215 220 225 ggc agg ccc cca tgg gca gcg gtg gta gga acg ccc cca cag gcg cac 894 Gly Arg Pro Pro Trp Ala Ala Val Val Gly Thr Pro Pro Gln Ala His 230 235 240 245 agc agc cta cag ccc cac cac cac cca tgg gag cct tct gtg aga gag 942 Ser Ser Leu Gln Pro His His His Pro Trp Glu Pro Ser Val Arg Glu 250 255 260 agc ctc tgt tcc aca tgg ccc tgg aaa aat gag gat ttt aac caa aaa 990 Ser Leu Cys Ser Thr Trp Pro Trp Lys Asn Glu Asp Phe Asn Gln Lys 265 270 275 ttc aca cag ctg cta ctt cta caa aga cct cac ccc aga agc caa gat 1038 Phe Thr Gln Leu Leu Leu Leu Gln Arg Pro His Pro Arg Ser Gln Asp 280 285 290 ccc ctg gtc aag aga agc tgg cct gat tat gtg gag gag aat cga gga 1086 Pro Leu Val Lys Arg Ser Trp Pro Asp Tyr Val Glu Glu Asn Arg Gly 295 300 305 cat tta att gag atc aga gac tta ttt ggc cca ggc ctg gat acc caa 1134 His Leu Ile Glu Ile Arg Asp Leu Phe Gly Pro Gly Leu Asp Thr Gln 310 315 320 325 gaa cct cgc ata gtc ata ctg cag ggg gct gct gga att ggg aag tca 1182 Glu Pro Arg Ile Val Ile Leu Gln Gly Ala Ala Gly Ile Gly Lys Ser 330 335 340 aca ctg gcc agg cag gtg aag gaa gcc tgg ggg aga ggc cag ctg tat 1230 Thr Leu Ala Arg Gln Val Lys Glu Ala Trp Gly Arg Gly Gln Leu Tyr 345 350 355 ggg gac cgc ttc cag cat gtc ttc tac ttc agc tgc aga gag ctg gcc 1278 Gly Asp Arg Phe Gln His Val Phe Tyr Phe Ser Cys Arg Glu Leu Ala 360 365 370 cag tcc aag gtg gtg agt ctc gct gag ctc atc gga aaa gat ggg aca 1326 Gln Ser Lys Val Val Ser Leu Ala Glu Leu Ile Gly Lys Asp Gly Thr 375 380 385 gcc act ccg gct ccc att aga cag atc ctg tct agg cca gag cgg ctg 1374 Ala Thr Pro Ala Pro Ile Arg Gln Ile Leu Ser Arg Pro Glu Arg Leu 390 395 400 405 ctc ttc atc ctc gat ggt gta gat gag cca gga tgg gtc ttg cag gag 1422 Leu Phe Ile Leu Asp Gly Val Asp Glu Pro Gly Trp Val Leu Gln Glu 410 415 420 ccg agt tct gag ctc tgt ctg cac tgg agc cag cca cag ccg gcg gat 1470 Pro Ser Ser Glu Leu Cys Leu His Trp Ser Gln Pro Gln Pro Ala Asp 425 430 435 gca ctg ctg ggc agt ttg ctg ggg aaa act ata ctt ccc gag gca tcc 1518 Ala Leu Leu Gly Ser Leu Leu Gly Lys Thr Ile Leu Pro Glu Ala Ser 440 445 450 ttt ctg atc acg gct cgg acc aca gct ctg cag aac ctc att cct tct 1566 Phe Leu Ile Thr Ala Arg Thr Thr Ala Leu Gln Asn Leu Ile Pro Ser 455 460 465 ttg gag cag gca cgt tgg gta gag gtc ctg ggg ttc tct gag tcc agc 1614 Leu Glu Gln Ala Arg Trp Val Glu Val Leu Gly Phe Ser Glu Ser Ser 470 475 480 485 agg aag gaa tat ttc tac aga tat ttc aca gat gaa agg caa gca att 1662 Arg Lys Glu Tyr Phe Tyr Arg Tyr Phe Thr Asp Glu Arg Gln Ala Ile 490 495 500 aga gcc ttt agg ttg gtc aaa tca aac aaa gag ctc tgg gcc ctg tgt 1710 Arg Ala Phe Arg Leu Val Lys Ser Asn Lys Glu Leu Trp Ala Leu Cys 505 510 515 ctt gtg ccc tgg gtg tcc tgg ctg gcc tgc act tgc ctg atg cag cag 1758 Leu Val Pro Trp Val Ser Trp Leu Ala Cys Thr Cys Leu Met Gln Gln 520 525 530 atg aag cgg aag gaa aaa ctc aca ctg act tcc aag acc acc aca acc 1806 Met Lys Arg Lys Glu Lys Leu Thr Leu Thr Ser Lys Thr Thr Thr Thr 535 540 545 ctc tgt cta cat tac ctt gcc cag gct ctc caa gct cag cca ttg gga 1854 Leu Cys Leu His Tyr Leu Ala Gln Ala Leu Gln Ala Gln Pro Leu Gly 550 555 560 565 ccc cag ctc aga gac ctc tgc tct ctg gct gct gag ggc atc tgg caa 1902 Pro Gln Leu Arg Asp Leu Cys Ser Leu Ala Ala Glu Gly Ile Trp Gln 570 575 580 aaa aag acc ctt ttc agt cca gat gac ctc agg aag cat ggg tta gat 1950 Lys Lys Thr Leu Phe Ser Pro Asp Asp Leu Arg Lys His Gly Leu Asp 585 590 595 ggg gcc atc atc tcc acc ttc ttg aag atg ggt att ctt caa gag cac 1998 Gly Ala Ile Ile Ser Thr Phe Leu Lys Met Gly Ile Leu Gln Glu His 600 605 610 ccc atc cct ctg agc tac agc ttc att cac ctc tgt ttc cag gag ttc 2046 Pro Ile Pro Leu Ser Tyr Ser Phe Ile His Leu Cys Phe Gln Glu Phe 615 620 625 ttt gca gca atg tcc tat gtc ttg gag gat gag aag ggg aga ggt aaa 2094 Phe Ala Ala Met Ser Tyr Val Leu Glu Asp Glu Lys Gly Arg Gly Lys 630 635 640 645 cat tct aat tgc atc ata gat ttg gaa aag acg cta gaa gca tat gga 2142 His Ser Asn Cys Ile Ile Asp Leu Glu Lys Thr Leu Glu Ala Tyr Gly 650 655 660 ata cat ggc ctg ttt ggg gca tca acc aca cgt ttc cta ttg ggc ctg 2190 Ile His Gly Leu Phe Gly Ala Ser Thr Thr Arg Phe Leu Leu Gly Leu 665 670 675 tta agt gat gag ggg gag aga gag atg gag aac atc ttt cac tgc cgg 2238 Leu Ser Asp Glu Gly Glu Arg Glu Met Glu Asn Ile Phe His Cys Arg 680 685 690 ctg tct cag ggg agg aac ctg atg cag tgg gtc ccg tcc ctt cag ctg 2286 Leu Ser Gln Gly Arg Asn Leu Met Gln Trp Val Pro Ser Leu Gln Leu 695 700 705 ctg ctg cag cca cac tct ctg gag tcc ctc cac tgc ttg tat gag act 2334 Leu Leu Gln Pro His Ser Leu Glu Ser Leu His Cys Leu Tyr Glu Thr 710 715 720 725 cgg aac aaa acg ttc ctg aca caa gtg atg gcc cat ttc gaa gaa atg 2382 Arg Asn Lys Thr Phe Leu Thr Gln Val Met Ala His Phe Glu Glu Met 730 735 740 ggc atg tgt gta gaa aca gac atg gag ctc tta gtg tgc act ttc tgc 2430 Gly Met Cys Val Glu Thr Asp Met Glu Leu Leu Val Cys Thr Phe Cys 745 750 755 att aaa ttc agc cgc cac gtg aag aag ctt cag ctg att gag ggc agg 2478 Ile Lys Phe Ser Arg His Val Lys Lys Leu Gln Leu Ile Glu Gly Arg 760 765 770 cag cac aga tca aca tgg agc ccc agc atg gta gtc ctg ttc agg tgg 2526 Gln His Arg Ser Thr Trp Ser Pro Ser Met Val Val Leu Phe Arg Trp 775 780 785 gtc cca gtc aca gat gcc tat tgg cag att ctc ttc tcc gtc ctc aag 2574 Val Pro Val Thr Asp Ala Tyr Trp Gln Ile Leu Phe Ser Val Leu Lys 790 795 800 805 gtc acc aga aac ctg aag gag ctg gac cta agt gga aac tcg ctg agc 2622 Val Thr Arg Asn Leu Lys Glu Leu Asp Leu Ser Gly Asn Ser Leu Ser 810 815 820 cac tct gca gtg aag agt ctt tgt aag acc ctg aga cgc cct cgc tgc 2670 His Ser Ala Val Lys Ser Leu Cys Lys Thr Leu Arg Arg Pro Arg Cys 825 830 835 ctc ctg gag acc ctg cgg ttg gct ggc tgt ggc ctc aca gct gag gac 2718 Leu Leu Glu Thr Leu Arg Leu Ala Gly Cys Gly Leu Thr Ala Glu Asp 840 845 850 tgt aag gac ctt gcc ttt ggg ctg aga gcc aac cag acc ctg acc gag 2766 Cys Lys Asp Leu Ala Phe Gly Leu Arg Ala Asn Gln Thr Leu Thr Glu 855 860 865 ctg gac ctg agc ttc aat gtg ctc atg gat gct gga gcc aaa cac ctt 2814 Leu Asp Leu Ser Phe Asn Val Leu Met Asp Ala Gly Ala Lys His Leu 870 875 880 885 tgc cag aga ctg aga cag ccg agc tgc aag cta cag cga ctg cag ctg 2862 Cys Gln Arg Leu Arg Gln Pro Ser Cys Lys Leu Gln Arg Leu Gln Leu 890 895 900 gtc agc tgt ggc ctc acg tct gac tgc tgc cag gac ctg gcc tct gtg 2910 Val Ser Cys Gly Leu Thr Ser Asp Cys Cys Gln Asp Leu Ala Ser Val 905 910 915 ctt agt gcc agc ccc agc ctg aag gag cta gac ctg cag cag aac aac 2958 Leu Ser Ala Ser Pro Ser Leu Lys Glu Leu Asp Leu Gln Gln Asn Asn 920 925 930 ctg gat gac gtt ggc gtg cga ctg ctc tgt gag ggg ctc agg cat cct 3006 Leu Asp Asp Val Gly Val Arg Leu Leu Cys Glu Gly Leu Arg His Pro 935 940 945 gcc tgc aaa ctc ata cgc ctg ggg ctg gac cag acg act ctg agt gat 3054 Ala Cys Lys Leu Ile Arg Leu Gly Leu Asp Gln Thr Thr Leu Ser Asp 950 955 960 965 gag atg agg cag gag ctg agg gcc ctg gag cag gag aag cct cag ctg 3102 Glu Met Arg Gln Glu Leu Arg Ala Leu Glu Gln Glu Lys Pro Gln Leu 970 975 980 ctc atc ttc agc aga cgg aaa cca agt gtg atg acc cct att gag ggc 3150 Leu Ile Phe Ser Arg Arg Lys Pro Ser Val Met Thr Pro Ile Glu Gly 985 990 995 ctg gat acg gga gag atg agt aat agc aca tcc tca ctc aag cgg cag 3198 Leu Asp Thr Gly Glu Met Ser Asn Ser Thr Ser Ser Leu Lys Arg Gln 1000 1005 1010 aga ctc gga tca gag agg gcg gct tcc cat gtt gct cag gct aat ctc 3246 Arg Leu Gly Ser Glu Arg Ala Ala Ser His Val Ala Gln Ala Asn Leu 1015 1020 1025 aaa ctc ctg gac gtg agc aag atc ttc cca att gct gag att gca gag 3294 Lys Leu Leu Asp Val Ser Lys Ile Phe Pro Ile Ala Glu Ile Ala Glu 1030 1035 1040 1045 gaa agc tcc cca gag gta gta ccg gtg gaa ctc ttg tgc gtg cct tct 3342 Glu Ser Ser Pro Glu Val Val Pro Val Glu Leu Leu Cys Val Pro Ser 1050 1055 1060 cct gcc tct caa ggg gac ctg cat acg aag cct ttg ggg act gac gat 3390 Pro Ala Ser Gln Gly Asp Leu His Thr Lys Pro Leu Gly Thr Asp Asp 1065 1070 1075 gac ttc tgg ggc ccc acg ggg cct gtg gct act gag gta gtt gac aaa 3438 Asp Phe Trp Gly Pro Thr Gly Pro Val Ala Thr Glu Val Val Asp Lys 1080 1085 1090 gaa aag aac ttg tac cga gtt cac ttc cct gta gct ggc tcc tac cgc 3486 Glu Lys Asn Leu Tyr Arg Val His Phe Pro Val Ala Gly Ser Tyr Arg 1095 1100 1105 tgg ccc aac acg ggt ctc tgc ttt gtg gtg aga gaa gcg gtg acc gtt 3534 Trp Pro Asn Thr Gly Leu Cys Phe Val Val Arg Glu Ala Val Thr Val 1110 1115 1120 1125 gag att gaa ttc tgt gtg tgg gac cag ttc ctg ggt gag atc aac cca 3582 Glu Ile Glu Phe Cys Val Trp Asp Gln Phe Leu Gly Glu Ile Asn Pro 1130 1135 1140 cag cac agc tgg atg gtg gca ggg cct ctg ctg gac atc aag gct gag 3630 Gln His Ser Trp Met Val Ala Gly Pro Leu Leu Asp Ile Lys Ala Glu 1145 1150 1155 cct gga gcc gtg gaa gct gtg cac ctc cct cac ttt gtg gct ctc caa 3678 Pro Gly Ala Val Glu Ala Val His Leu Pro His Phe Val Ala Leu Gln 1160 1165 1170 ggg ggc cat gtg gac aca tcc ctg ttc caa gtg gcc cac ttt aaa gag 3726 Gly Gly His Val Asp Thr Ser Leu Phe Gln Val Ala His Phe Lys Glu 1175 1180 1185 gag ggg atg ctc ctg gag aag cca gcc agg gtg gag ctg cat cac ata 3774 Glu Gly Met Leu Leu Glu Lys Pro Ala Arg Val Glu Leu His His Ile 1190 1195 1200 1205 gtt ctg gaa aac ccc agc ttt tcc ccc ttg gga gtc ctc ctg aaa atg 3822 Val Leu Glu Asn Pro Ser Phe Ser Pro Leu Gly Val Leu Leu Lys Met 1210 1215 1220 atc cat aat gcc ctg cgc ttc att ccc gtc acc tct gtg gtg ttg ctt 3870 Ile His Asn Ala Leu Arg Phe Ile Pro Val Thr Ser Val Val Leu Leu 1225 1230 1235 tac cac cgc ctc cat cct gag gaa gtc acc ttc cac ctc tac ctg atc 3918 Tyr His Arg Leu His Pro Glu Glu Val Thr Phe His Leu Tyr Leu Ile 1240 1245 1250 cca agt gac tgc tcc att cgg aag gaa ctg gag ctc tgc tat cga agc 3966 Pro Ser Asp Cys Ser Ile Arg Lys Glu Leu Glu Leu Cys Tyr Arg Ser 1255 1260 1265 cct gga gaa gac cag ctg ttc tcg gag ttc tac gtt ggc cac ttg gga 4014 Pro Gly Glu Asp Gln Leu Phe Ser Glu Phe Tyr Val Gly His Leu Gly 1270 1275 1280 1285 tca ggg atc agg ctg caa gtg aaa gac aag aaa gat gag act ctg gtg 4062 Ser Gly Ile Arg Leu Gln Val Lys Asp Lys Lys Asp Glu Thr Leu Val 1290 1295 1300 tgg gag gcc ttg gtg aaa cca gga gat ctc atg cct gca act act ctg 4110 Trp Glu Ala Leu Val Lys Pro Gly Asp Leu Met Pro Ala Thr Thr Leu 1305 1310 1315 atc cct cca gcc tgc ata gcc gta cct tca cct ctg gat gcc ccg cag 4158 Ile Pro Pro Ala Cys Ile Ala Val Pro Ser Pro Leu Asp Ala Pro Gln 1320 1325 1330 ttg ctg cac ttt gtg gac cag tat cga gag cag ctg ata gcc cga gtg 4206 Leu Leu His Phe Val Asp Gln Tyr Arg Glu Gln Leu Ile Ala Arg Val 1335 1340 1345 aca tcg gtg gag gtt gtc ttg gac aaa ctg cat gga cag gtg ctg agc 4254 Thr Ser Val Glu Val Val Leu Asp Lys Leu His Gly Gln Val Leu Ser 1350 1355 1360 1365 cag gag cag tac gag agg gtg ctg gct gag aac acg agg ccc agc cag 4302 Gln Glu Gln Tyr Glu Arg Val Leu Ala Glu Asn Thr Arg Pro Ser Gln 1370 1375 1380 atg cgg aag ctg ttc agc ttg agc cag tcc tgg gac cgg aag tgc aaa 4350 Met Arg Lys Leu Phe Ser Leu Ser Gln Ser Trp Asp Arg Lys Cys Lys 1385 1390 1395 gat gga ctc tac caa gcc ctg aag gag acc cat cct cac ctc att atg 4398 Asp Gly Leu Tyr Gln Ala Leu Lys Glu Thr His Pro His Leu Ile Met 1400 1405 1410 gaa ctc tgg gag aag ggc agc aaa aag gga ctc ctg cca ctc agc agc 4446 Glu Leu Trp Glu Lys Gly Ser Lys Lys Gly Leu Leu Pro Leu Ser Ser 1415 1420 1425 tga agtatgaaca ccagcccttg acccttgagt cctggctttg gctgaccctt 4499 ccttgggtct cagtttcttt ctctgcaaac aagttgccat ctggtttgcc ttccagcact 4559 aaagtaatgg aactttgatg atgcctttgc tgggcattat gtgtccatgc cagggatgcc 4619 acagggggcc ccagtccagg tggcctaaca gcatctcagg gaatgtccat ctggagctgg 4679 caagacccct gcagacctca tagagcctca tctggtggcc acagcagcca agcctagagc 4739 cctccggatc ccatccaggc gcaaagagga ataggaggga catggaacca tttgcctctg 4799 gctgtgtcac agggtgagcc ccaaaattgg ggttcagcgt gggaggccac gtggattctt 4859 ggctttgtac aggaagatct acaagagcaa gccaacagag taaagtggaa ggaagtttat 4919 tcagaaaata aaggagtatc actgctcttt tagaatttgt ctagcagact ttccagtttt 4979 taccagaaaa cccctataaa ttaaaaattt tttacttaaa tttaagaatt aaaaaaatac 5039 aaaaaagaaa aaatgaaaat aaaggaataa gaagttaaaa aaaaaaaaaa aaaaaaaaaa 5099 a 5100 13 763 DNA Homo sapiens unsure 45 unknown 13 ctttctcccc cttgggagtc ttcctgaaaa tgatccataa tgccntgcgc ttcattcccg 60 tcacctctgt ggtgttgctt taccaccgcg tccatcctga gaaagtcacc ttccacctct 120 acctgatccc aagtgactgc tccattcgga aggccataga tgatctagaa atgaaattcc 180 agtttgtgcg aatccacaag ccacccccgc tgaccccact ttatatgggc tgtcgttaca 240 ctgtgtctgg gtctggttca gggatgctgg aaatactccc caaggaactg gagctctgct 300 atcgaagccc tggagaagac cagctgttct cggagttcta cgttggccac ttgggatcag 360 ggatcaggct gcaagtgaaa gacaagaaag atgagactct ggtgtgggag gccttggtga 420 aaccaggaga tctcatgcct gcaactactc tgatccctcc agcccgcata ggaaggaaca 480 ccagccagcc gtggaacctc aggtgcaaca gagacgccag gagatactag tgcccagcag 540 cctgcggcag taccaatgaa gccagagagg gcttggtgga tgacaaggag gcctgagtag 600 accgcaggtg ggtctgagaa atgggcttag gtgaggcagg tctttgaagg atttgttctt 660 aatcatatgc gagatgctca aaaggctgga tgcctgcttt tgtgggtgaa gagcaagaag 720 agaaaacagg ttgtacacat acagatgcag atggagagac aga 763 14 20 DNA Artificial Sequence Antisense Oligonucleotide 14 ggaatgggct ttcagaaccc 20 15 20 DNA Artificial Sequence Antisense Oligonucleotide 15 gagccgcaga gcagggaatg 20 16 20 DNA Artificial Sequence Antisense Oligonucleotide 16 ctgttctgtg ttcctcagat 20 17 20 DNA Artificial Sequence Antisense Oligonucleotide 17 gggagtggta ggaaaagcca 20 18 20 DNA Artificial Sequence Antisense Oligonucleotide 18 ccgtctctta ttcagcattc 20 19 20 DNA Artificial Sequence Antisense Oligonucleotide 19 gcaggcagag aacagtgctg 20 20 20 DNA Artificial Sequence Antisense Oligonucleotide 20 ctctgtcccg gagttaagag 20 21 20 DNA Artificial Sequence Antisense Oligonucleotide 21 aagctggaac tccttcagct 20 22 20 DNA Artificial Sequence Antisense Oligonucleotide 22 ggagggctag gtcccaggcc 20 23 20 DNA Artificial Sequence Antisense Oligonucleotide 23 cccgccggca attcatggat 20 24 20 DNA Artificial Sequence Antisense Oligonucleotide 24 aaacccttct ctctgagccc 20 25 20 DNA Artificial Sequence Antisense Oligonucleotide 25 gcagagattt ctctccagcg 20 26 20 DNA Artificial Sequence Antisense Oligonucleotide 26 ccattgggtc ccaggagcct 20 27 20 DNA Artificial Sequence Antisense Oligonucleotide 27 ggttaaaatc ctcatttttc 20 28 20 DNA Artificial Sequence Antisense Oligonucleotide 28 gcagctgaag tagaagacat 20 29 20 DNA Artificial Sequence Antisense Oligonucleotide 29 ctgggccagc tctctgcagc 20 30 20 DNA Artificial Sequence Antisense Oligonucleotide 30 tgatcaggaa ggatgcctcg 20 31 20 DNA Artificial Sequence Antisense Oligonucleotide 31 acaggccatg tattccatat 20 32 20 DNA Artificial Sequence Antisense Oligonucleotide 32 gagtctcgta caagcagtgg 20 33 20 DNA Artificial Sequence Antisense Oligonucleotide 33 ccacctgaac aggactacca 20 34 20 DNA Artificial Sequence Antisense Oligonucleotide 34 aagagaatct gccaataggc 20 35 20 DNA Artificial Sequence Antisense Oligonucleotide 35 tttctggtga ccttgaggac 20 36 20 DNA Artificial Sequence Antisense Oligonucleotide 36 ggtcttacaa agactcttca 20 37 20 DNA Artificial Sequence Antisense Oligonucleotide 37 gctgtctcag tctctggcaa 20 38 20 DNA Artificial Sequence Antisense Oligonucleotide 38 gctgaccagc tgcagtcgct 20 39 20 DNA Artificial Sequence Antisense Oligonucleotide 39 ctggtccagc cccaggcgta 20 40 20 DNA Artificial Sequence Antisense Oligonucleotide 40 tcctgcctca tctcatcact 20 41 20 DNA Artificial Sequence Antisense Oligonucleotide 41 cctcagttcc tgcctcatct 20 42 20 DNA Artificial Sequence Antisense Oligonucleotide 42 ccagggccct cagttcctgc 20 43 20 DNA Artificial Sequence Antisense Oligonucleotide 43 acttggtttc cgtctgctga 20 44 20 DNA Artificial Sequence Antisense Oligonucleotide 44 ctctgccgct tgagtgagga 20 45 20 DNA Artificial Sequence Antisense Oligonucleotide 45 cccttgagag gcaggagaag 20 46 20 DNA Artificial Sequence Antisense Oligonucleotide 46 gacccgtgtt gggccagcgg 20 47 20 DNA Artificial Sequence Antisense Oligonucleotide 47 gtcaccgctt ctctcatcac 20 48 20 DNA Artificial Sequence Antisense Oligonucleotide 48 caaagtgagg gaggtgcaca 20 49 20 DNA Artificial Sequence Antisense Oligonucleotide 49 acatggcccc cttggagagc 20 50 20 DNA Artificial Sequence Antisense Oligonucleotide 50 ggctggcttc tccaggagca 20 51 20 DNA Artificial Sequence Antisense Oligonucleotide 51 gcggtggtaa agcaacacca 20 52 20 DNA Artificial Sequence Antisense Oligonucleotide 52 gctccagttc cttccgaatg 20 53 20 DNA Artificial Sequence Antisense Oligonucleotide 53 tcgatagcag agctccagtt 20 54 20 DNA Artificial Sequence Antisense Oligonucleotide 54 accagagtct catctttctt 20 55 20 DNA Artificial Sequence Antisense Oligonucleotide 55 tctcctggtt tcaccaaggc 20 56 20 DNA Artificial Sequence Antisense Oligonucleotide 56 atgagatctc ctggtttcac 20 57 20 DNA Artificial Sequence Antisense Oligonucleotide 57 gaaggtacgg ctatgcgggc 20 58 20 DNA Artificial Sequence Antisense Oligonucleotide 58 aacctccacc gatgtcactc 20 59 20 DNA Artificial Sequence Antisense Oligonucleotide 59 tgtccaagac aacctccacc 20 60 20 DNA Artificial Sequence Antisense Oligonucleotide 60 gtccatcttt gcacttccgg 20 61 20 DNA Artificial Sequence Antisense Oligonucleotide 61 agttccataa tgaggtgagg 20 62 20 DNA Artificial Sequence Antisense Oligonucleotide 62 ctgctgagtg gcaggagtcc 20 63 20 DNA Artificial Sequence Antisense Oligonucleotide 63 ttgatacttc agctgctgag 20 64 20 DNA Artificial Sequence Antisense Oligonucleotide 64 aaagccagga ctcaagggtc 20 65 20 DNA Artificial Sequence Antisense Oligonucleotide 65 actttagtgc tggaaggcaa 20 66 20 DNA Artificial Sequence Antisense Oligonucleotide 66 ctggcatgga cacataatgc 20 67 20 DNA Artificial Sequence Antisense Oligonucleotide 67 gctccagatg gacattccct 20 68 20 DNA Artificial Sequence Antisense Oligonucleotide 68 cttggctgct gtggccacca 20 69 20 DNA Artificial Sequence Antisense Oligonucleotide 69 aggcaaatgg ttccatgtcc 20 70 20 DNA Artificial Sequence Antisense Oligonucleotide 70 atacagcatg tagccttttt 20 71 20 DNA Artificial Sequence Antisense Oligonucleotide 71 cttcttctat tgcgcaatct 20 72 20 DNA Artificial Sequence Antisense Oligonucleotide 72 gccagccaac ctgtggaaga 20 73 20 DNA Artificial Sequence Antisense Oligonucleotide 73 gaacaattac tgcagtcgct 20 74 20 DNA Artificial Sequence Antisense Oligonucleotide 74 taccgtctgc tgaagatgag 20 75 20 DNA Artificial Sequence Antisense Oligonucleotide 75 tccaacttac cttcttgcta 20 76 20 DNA Artificial Sequence Antisense Oligonucleotide 76 ttagtgtcac cttccgaatg 20 77 20 DNA Artificial Sequence Antisense Oligonucleotide 77 catctatggc cttccgaatg 20 78 20 DNA Artificial Sequence Antisense Oligonucleotide 78 gaatttcatt tctagatcat 20 79 20 DNA Artificial Sequence Antisense Oligonucleotide 79 gattcgcaca aactggaatt 20 80 20 DNA Artificial Sequence Antisense Oligonucleotide 80 agtgtaacga cagcccatat 20 81 20 DNA Artificial Sequence Antisense Oligonucleotide 81 caaagctcca gttccttggg 20 82 20 DNA Artificial Sequence Antisense Oligonucleotide 82 gtctctgttg cacctgaggt 20 83 20 DNA Artificial Sequence Antisense Oligonucleotide 83 attggtactg ccgcaggctg 20 84 20 DNA Artificial Sequence Antisense Oligonucleotide 84 ctggcttcat tggtactgcc 20 85 20 DNA Artificial Sequence Antisense Oligonucleotide 85 catctcgcat atgattaaga 20 86 20 DNA Artificial Sequence Antisense Oligonucleotide 86 catctgtatg tgtacaacct 20 87 20 DNA Artificial Sequence Antisense Oligonucleotide 87 ccttttaaaa gaaatattta 20 88 20 DNA Artificial Sequence Antisense Oligonucleotide 88 ctcggcagtc tctgcgcagc 20 89 20 DNA Artificial Sequence Antisense Oligonucleotide 89 ggtccgccgg tcctctcagc 20 90 20 DNA Artificial Sequence Antisense Oligonucleotide 90 ccagccatct tgttctgtgt 20 91 20 DNA Artificial Sequence Antisense Oligonucleotide 91 ggtgttcctt cctatgcggg 20 

What is claimed is:
 1. A compound 8 to 50 nucleobases in length targeted to a nucleic acid molecule encoding NAC, wherein said compound specifically hybridizes with said nucleic acid molecule encoding NAC and inhibits the expression of NAC.
 2. The compound of claim 1 which is an antisense oligonucleotide.
 3. The compound of claim 2 wherein the antisense oligonucleotide has a sequence comprising SEQ ID NO: 14, 16, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 34, 35, 36, 37, 40, 42, 44, 45, 47, 48, 50, 51, 53, 54, 55, 60, 64, 65, 66, 67, 68, 69, 70, 72, 73, 74, 78, 79, 80, 82, 86, 88 or
 90. 4. The compound of claim 2 wherein the antisense oligonucleotide comprises at least one modified internucleoside linkage.
 5. The compound of claim 4 wherein the modified internucleoside linkage is a phosphorothioate linkage.
 6. The compound of claim 2 wherein the antisense oligonucleotide comprises at least one modified sugar moiety.
 7. The compound of claim 6 wherein the modified sugar moiety is a 2′-O-methoxyethyl sugar moiety.
 8. The compound of claim 2 wherein the antisense oligonucleotide comprises at least one modified nucleobase.
 9. The compound of claim 8 wherein the modified nucleobase is a 5-methylcytosine.
 10. The compound of claim 2 wherein the antisense oligonucleotide is a chimeric oligonucleotide.
 11. A compound 8 to 50 nucleobases in length which specifically hybridizes with at least an 8-nucleobase portion of an active site on a nucleic acid molecule encoding NAC.
 12. A composition comprising the compound of claim 1 and a pharmaceutically acceptable carrier or diluent.
 13. The composition of claim 12 further comprising a colloidal dispersion system.
 14. The composition of claim 12 wherein the compound is an antisense oligonucleotide.
 15. A method of inhibiting the expression of NAC in cells or tissues comprising contacting said cells or tissues with the compound of claim 1 so that expression of NAC is inhibited.
 16. A method of treating an animal having a disease or condition associated with NAC comprising administering to said animal a therapeutically or prophylactically effective amount of the compound of claim 1 so that expression of NAC is inhibited.
 17. The method of claim 16 wherein the disease or condition is a hyperproliferative disease.
 18. The method of claim 16 wherein the disease or disorder arises from aberrant apoptosis.
 19. The method of claim 16 wherein the disease or disorder is a neurological disease.
 20. The compound of claim 1 targeted to a nucleic acid molecule encoding NAC, wherein said compound specifically hybridizes with and inhibits the expression of an alternatively spliced variant of NAC. 